#

gemma

Here are 165 public repositories matching this topic...

ollama / ollama

Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.

go golang llama gemma mistral llm llms llava llama2 ollama llama3 phi3 gemma2

Updated Dec 5, 2024
Go

mudler / LocalAI

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more models architectures. Features: Generate Text, Audio, Video, Images, Voice Cloning, Distributed, P2P inference

Updated Dec 4, 2024
Go

unsloth

unslothai / unsloth

Finetune Llama 3.2, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 80% less memory

ai llama lora gemma mistral fine-tuning finetuning llm llms qlora unsloth llama3 phi3 gemma2

Updated Dec 4, 2024
Python

GaiZhenbiao / ChuanhuChatGPT

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

spark chatbot gemini llama minimax moss gemma claude ernie midjourney chatgpt-api chatglm stablelm ollama qwen dalle3 inspurai

Updated Dec 4, 2024
Python

yangjianxin1 / Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Updated Oct 24, 2024
Python

xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.

Updated Dec 3, 2024
Python

LostRuins / koboldcpp

Run GGUF models easily with a KoboldAI UI. One File. Zero Install.

llama language-model gemma mistral koboldai llm llamacpp ggml koboldcpp gguf

Updated Dec 4, 2024
C++

google / gemma_pytorch

The official PyTorch implementation of Google's Gemma models

google pytorch gemma

Updated Jul 31, 2024
Python

elia

darrenburns / elia

A snappy, keyboard-centric terminal user interface for interacting with large language models. Chat with ChatGPT, Claude, Llama 3, Phi 3, Mistral, Gemma and more.

python terminal ai tui llama gpt gemma mistral claude large-language-models llm chatgpt ollama ollama-interface ollama-client mixtral mistral-ai llama3 phi-3

Updated Oct 10, 2024
Python

google / generative-ai-docs

Documentation for Google's Gen AI site - including the Gemini API and Gemma

documentation machine-learning ai chatbot embeddings gemini gemma gemini-api llm

Updated Dec 2, 2024
Jupyter Notebook

nextjs-ollama-llm-ui

jakobhoeg / nextjs-ollama-llm-ui

Fully-featured, beautiful web interface for Ollama LLMs - built with NextJS. Deploy with a single click.

react typescript ai local offline nextjs chatbot localstorage openai gemma mistral tailwindcss llm shadcn ollama mistral-7b nextjs14

Updated Oct 31, 2024
TypeScript

gemma-cookbook

google-gemini / gemma-cookbook

A collection of guides and examples for the Gemma open models from Google.

gemma codegemma paligemma recurrentgemma

Updated Dec 4, 2024
Jupyter Notebook

mlc-ai / web-llm-chat

Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations.

chat privacy ai nextjs chatbot llama hermes chat-application gemma webgpu mistral phi2 large-language-models llm generative-ai chatgpt redpajama qwen tinyllama

Updated Nov 13, 2024
TypeScript

magpie-align / magpie

Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!

nlp paper dataset alignment gemma synthetic-data synthetic-dataset-generation llm supervised-finetuning llama2 qwen2 llama3 phi3

Updated Nov 5, 2024
Python

aikit

sozercan / aikit

🏗️ Fine-tune, build, and deploy open-source LLMs easily!

Updated Dec 4, 2024
Go

papersgpt / papersgpt-for-zotero

Zotero AI plugin chatting papers with ChatGPT, Gemini, Claude, Llama 3.2, QwQ-32B-Preview, Marco-o1, Gemma, Mistral and Phi-3.5

ai paper gemini summary zotero llama gemma mistral claude zotero-plugin chatgpt phi-3 marco-o1 qwq-32b-preview

Updated Dec 3, 2024
JavaScript

Beomi / InfiniTransformer

Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

transformers pytorch llama gemma huggingface infinitransformer llama3

Updated Apr 23, 2024
Python

InternLM / InternEvo

InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencies.

pytorch multi-modal gemma pipeline-parallelism transformers-models tensor-parallelism llava llm-training internlm flash-attention zero3 llm-framework sequence-parallelism internlm2 ring-attention deepspeed-ulysses llama3 910b

Updated Dec 4, 2024
Python

AI-Hypercomputer / JetStream

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

gpu inference pytorch transformer llama gpt gemma model-serving tpu jax mlops large-language-models llm llmops llm-inference llama2

Updated Nov 19, 2024
Python

inferflow / inferflow

Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).

bloom falcon moe gemma mistral mixture-of-experts model-quantization multi-gpu-inference m2m100 llamacpp llm-inference internlm llama2 qwen baichuan2 mixtral phi-2 deepseek minicpm

Updated Mar 15, 2024
C++

Improve this page

Add a description, image, and links to the gemma topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gemma topic, visit your repo's landing page and select "manage topics."