LLaVA-NeXT-Video Model Card Check out also the Google Colab demo to run Llava on a free-tier Google
410llava_next
LLaVA-NeXT-Video Model Card Check out also the Google Colab demo to run Llava on a free-tier Google
630llava_next
LLaVA-NeXT-Video Model Card Check out also the Google Colab demo to run Llava on a free-tier Google
460
LLaVa-Next, leveraging NousResearch/Nous-Hermes-2-Yi-34B as LLM The LLaVA-NeXT model was proposed in
650llava_nextvision
WhisperSpeech If you have questions or you want to help you can find us in the #audio-generation
330pytorchtext-to-speech
LLaVa-Next, leveraging liuhaotian/llava-v1.6-vicuna-13b as LLM The LLaVA-NeXT model was proposed in
920llava_nextvision
LLaVa-Next, leveraging liuhaotian/llava-v1.6-vicuna-7b as LLM The LLaVA-NeXT model was proposed in L
600llava_nextvision
LLaVa-Next, leveraging mistralai/Mistral-7B-Instruct-v0.2 as LLM The LLaVA-NeXT model was proposed i
420llava_nextvision
LLaVA Model Card Below is the model card of Llava model 13b, which is copied from the original Llav
730llava
Parler-TTS Mini v0.1 Fine-tuning guide on Colab: Parler-TTS Mini v0.1 is a lightweight
360
AuraSR GAN-based Super-Resolution for upscaling generated images, a variation of the GigaGAN paper
430pytorchart
MARS5: A novel speech model for insane prosody. This is the repo for the MARS5 English speech model
610pytorchtext-to-speech
BeautifulPrompt 简介 Brief Introduction 我们开源了一个自动Prompt生成模型,您可以直接输入一个极其简单的Prompt,就可以得到经过语言模型优化过的Prompt
480pytorchpytorch
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understand
580pytorch
MPP-Qwen-Next: Multimodal Pipeline Parallel based on QwenLM Github Repo URL Your browser does
340
This is the first in a series of models designed to replicate the prose quality of the Claude 3 mode
380pytorchchat
SD3 Controlnet control image weight=0.0 weight=0.3 weight=0.5 weight=0.7 weight=0.9
450pytorch
⚡ Flash Diffusion: FlashSD3 ⚡ Flash Diffusion is a diffusion distillation method proposed in Flash D
560pytorch
sdxl-emoji LoRA by fofr An SDXL fine-tune based on Apple Emojis > Inference with Replicate API Grab
530pytorchtext-to-image
Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering We introd
480pytorch
当前共162895个项目
×
寻找源码
源码描述
联系方式
提交