894 skills found · Page 1 of 30
zihangdai / XlnetXLNet: Generalized Autoregressive Pretraining for Language Understanding
salesforce / BLIPPyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
deepseek-ai / DeepSeek VL2DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
CLUEbenchmark / CLUE中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
deepseek-ai / DeepSeek VLDeepSeek-VL: Towards Real-World Vision-Language Understanding
DAMO-NLP-SG / Video LLaMA[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
X-PLUG / MPLUG DocOwlmPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
openai / Finetune Transformer LmCode and model for the paper "Improving Language Understanding by Generative Pre-Training"
namisan / Mt DnnMulti-Task Deep Neural Networks for Natural Language Understanding
alfredfrancis / AI Chatbot FrameworkA python chatbot framework with Natural Language Understanding and Artificial Intelligence.
ChineseGLUE / ChineseGLUELanguage Understanding Evaluation benchmark for Chinese: datasets, baselines, pre-trained models,corpus and leaderboard
DjangoPeng / Openai QuickstartA comprehensive guide to understanding and implementing large language models with hands-on examples using LangChain for GenAI applications.
hendrycks / TestMeasuring Massive Multitask Language Understanding | ICLR 2021
ByteDance-Seed / Seed1.5 VLSeed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.
stepfun-ai / Step Audio2Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.
MoonshotAI / Kimi VLKimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
NVIDIA / Audio FlamingoPyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models
brightmart / Bert Language UnderstandingPre-training of Deep Bidirectional Transformers for Language Understanding: pre-train TextCNN
PKU-YuanGroup / Chat UniVi[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
CBLUEbenchmark / CBLUE[CBLUE1] 中文医疗信息处理基准CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark