大模型 12个
大模型内刊
1 开了眼的ChatGPT真让人开了眼了
2 不服不行,ChatGPT加持下OpenAI的文生图模型又碾压对手了
3 妙鸭“免费”了,它不想只做一个偶尔刷屏的AI写真App
4 已经有人替OpenAI把GPT-5做出来了?
5 AI领域最火的论文平台arXiv,正在成为学术毒瘤?
6 ChatGPT流量持续三个月下滑,怎么回事?
7 浪潮信息 AI&HPC 应用软件首席架构师Allen演讲实录:大模型时代的算力之道
8 过去一个月,投资AI公司的钱都被谁拿走了?
9 在做大模型这件事上,腾讯不会成为一家创业公司
10 中文大模型比英文更烧钱,这居然是AI底层原理决定的?
11 ChatGPT企业版炸裂上线!无限制访问、两倍速、3.2万token……OpenAI开始“抢钱”了
12 谷歌新模型的算力已是GPT-4的5倍,要大力出奇迹反超OpenAI了?
13 迅速逼近ChatGPT!Llama 最新代码生成模型已经直追GPT-4了
14 用一部4K修复版《武状元苏乞儿》,去还胶片年代的“账”
15 坚定不移地同浪潮站在一处
16 还在担心辅导不了孩子数学?AI家教MathGPT来了!
17 英伟达业绩冲天,看来只有黄仁勋能打败黄仁勋了!
18 现在AI没意识,不代表以后没有!图灵奖得主Bengio最新论文:技术已不是障碍
19 OpenAI不藏着了,开放微调功能,不用其他工具就能搞一个你自己的ChatGPT
20 大模型晚报|字节跳动推出 AI 对话产品豆包,现已开启测试
21 字节跳动向大模型大乱斗扔出一个豆包
22 OpenAI的首次收购,指向一件事:他们要做应用了
23 大模型之争,讯飞星火准备好了
24 大模型晚报|OpenAI宣布首笔公开收购,买了一家做游戏的初创公司
25 大模型晚报|钉钉个人版将加入AI 文本生成模型,现在已开启内测申请
26 大模型晚报|Bing Chat 存在质量问题,微软回应将改善用户体验
27 像素版西部世界已经出现,里面的AI居民甚至已经开始看《1984》了
28 大模型晚报|ChatGPT每日成本约70万美元,OpenAI或已在破产边缘
29 硅谷大佬们都在追的“e/acc”,是个什么新玩意?
30 GPU瓶颈到底卡住了谁?
26 分钟前
1 AI Agents Are Here. What Now?
2 Visual Document Retrieval Goes Multilingual
3 CO₂ Emissions and Models Performance: Insights from the Open LLM Leaderboard
4 Introducing smolagents: simple agents that write actions in code.
5 Visualize and understand GPU memory in PyTorch
6 Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo
7 Evaluating Audio Reasoning with Big Bench Audio
8 Finally, a Replacement for BERT: Introducing ModernBERT
9 Bamba: Inference-Efficient Hybrid Mamba2 Model
10 Benchmarking Language Model Performance on 5th Gen Xeon at GCP
11 Welcome the Falcon 3 Family of Open Models!
12 Introducing the Synthetic Data Generator - Build Datasets with Natural Language
13 LeMaterial: an open source initiative to accelerate materials discovery and research
14 Open Preference Dataset for Text-to-Image Generation by the 🤗 Community
15 Hugging Face models in Amazon Bedrock
16 “How good are LLMs at fixing their mistakes? A chatbot arena experiment with Keras and TPUs
17 Welcome PaliGemma 2 – New vision language models by Google
18 Rethinking LLM Evaluation with 3C3H: AraGen Benchmark and Leaderboard
19 Investing in Performance: Fine-tune small models with LLM insights - a CFM case study
20 Open Source Developers Guide to the EU AI Act
21 SmolVLM - small yet mighty Vision Language Model
22 Rearchitecting Hugging Face Uploads and Downloads
23 You could have designed state of the art positional encoding
24 Introduction to the Open Leaderboard for Japanese LLMs
25 Faster Text Generation with Self-Speculative Decoding
26 From Files to Chunks: Improving Hugging Face Storage Efficiency
27 Letting Large Models Debate: The First Multilingual LLM Debate Competition
28 Judge Arena: Benchmarking LLMs as Evaluators
29 Share your open ML datasets on Hugging Face Hub!
30 Hugging Face + PyCharm
31 Argilla 2.4: Easily Build Fine-Tuning and Evaluation datasets on the Hub — No Code Required
32 Universal Assisted Generation: Faster Decoding with Any Assistant Model
33 Expert Support case study: Bolstering a RAG app with LLM-as-a-Judge
34 A Deepdive into Aya Expanse: Advancing the Frontier of Multilinguality
35 CinePile 2.0 - making stronger datasets with adversarial refinement
36 Introducing HUGS - Scale your AI with Open Models
37 Introducing SynthID Text
38 Deploying Speech-to-Speech on Hugging Face
39 Releasing Outlines-core 0.1.0: structured generation in Rust and Python
40 🧨 Diffusers welcomes Stable Diffusion 3.5 Large
41 Transformers.js v3: WebGPU support, new models & tasks, and more…
42 Hugging Face Teams Up with Protect AI: Enhancing Model Security for the Community
43 Llama 3.2 in Keras
44 Fixing Gradient Accumulation
45 A Security Review of Gradio 5
46 Introducing the AMD 5th Gen EPYC™ CPU
47 Welcome, Gradio 5
48 Scaling AI-based Data Processing with Hugging Face + Dask
49 Faster Assisted Generation with Dynamic Speculation
50 Improving Parquet Dedupe on Hugging Face Hub
51 Introducing the Open FinLLM Leaderboard
52 A Short Summary of Chinese AI Global Expansion
53 🇨🇿 BenCzechMark - Can your LLM Understand Czech?
54 Converting Vertex-Colored Meshes to Textured Meshes
55 Llama can now see and run on your device - welcome Llama 3.2
56 Exploring the Daily Papers Page on Hugging Face
57 FineVideo: behind the scenes
58 Optimize and deploy models with Optimum-Intel and OpenVINO GenAI
59 Fine-tuning LLMs to 1.58bit: extreme quantization made easy
60 Introducing the SQL Console on Datasets
61 Introducing Community Tools on HuggingChat
62 Accelerate 1.0.0
63 Hugging Face partners with TruffleHog to Scan for Secrets
64 Scaling robotics datasets with video encoding
65 The 5 Most Under-Rated Tools on Hugging Face
66 Improving Hugging Face Training Efficiency Through Packing with Flash Attention
67 Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI
68 A failed experiment: Infini-Attention, and why we should keep trying?
69 Introduction to ggml
70 Tool Use, Unified
71 Welcome FalconMamba: The first strong attention-free 7B model
72 XetHub is joining Hugging Face!
73 Introducing TextImage Augmentation for Document Images
74 2024 Security Feature Highlights
75 Google releases Gemma 2 2B, ShieldGemma and Gemma Scope
76 Memory-efficient Diffusion Transformers with Quanto and Diffusers
77 Serverless Inference with Hugging Face and NVIDIA NIMs
78 LAVE: Zero-shot VQA Evaluation on Docmatix with LLMs - Do We Still Need Fine-Tuning?
79 Llama 3.1 - 405B, 70B & 8B with multilinguality and long context
80 WWDC 24: Running Mistral 7B with Core ML
81 Docmatix - a huge dataset for Document Visual Question Answering
82 TGI Multi-LoRA: Deploy Once, Serve 30 Models
83 How we leveraged distilabel to create an Argilla 2.0 Chatbot
84 SmolLM - blazingly fast and remarkably powerful
85 How NuminaMath Won the 1st AIMO Progress Prize
86 Preference Optimization for Vision Language Models
87 Experimenting with Automatic PII Detection on the Hub using Presidio
88 Announcing New Hugging Face and KerasHub integration
89 Banque des Territoires (CDC Group) x Polyconseil x Hugging Face: Enhancing a Major French Environmental Program with a Sovereign Data Solution
90 Google Cloud TPUs made available to Hugging Face users
91 Announcing New Dataset Search Features
92 Accelerating Protein Language Model ProtST on Intel Gaudi 2
93 Our Transformers Code Agent beats the GAIA benchmark!
94 Welcome Gemma 2 - Google's new open LLM
95 XLSCOUT Unveils ParaEmbed 2.0: a Powerful Embedding Model Tailored for Patents and IP with Expert Support from Hugging Face
96 Ethics and Society Newsletter #6: Building Better AI: The Importance of Data Quality
97 Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models
98 Data Is Better Together: A Look Back and Forward
99 Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap
100 BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks
10 分钟前
Most downloads Models
1 microsoft/resnet-50 Image Classification
2 sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity
3 google-bert/bert-base-uncased Fill-Mask
4 openai/clip-vit-large-patch14 Zero-Shot Image Classification
5 timm/mobilenetv3_small_100.lamb_in1k Image Classification
6 ByteDance/AnimateDiff-Lightning Text-to-Video
7 timm/resnet50.a1_in1k Image Classification
8 sentence-transformers/all-mpnet-base-v2 Sentence Similarity
9 FacebookAI/roberta-base Fill-Mask
10 openai/clip-vit-base-patch32 Zero-Shot Image Classification
11 jonatasgrosman/wav2vec2-large-xlsr-53-english Automatic Speech Recognition
12 FacebookAI/xlm-roberta-base Fill-Mask
13 distributed/optimized-gpt2-2b Text Generation
14 microsoft/codebert-base Feature Extraction
15 openai/clip-vit-base-patch16 Zero-Shot Image Classification
16 distilbert/distilbert-base-uncased Fill-Mask
17 openai/whisper-small Automatic Speech Recognition
18 pyannote/wespeaker-voxceleb-resnet34-LM
19 pyannote/segmentation-3.0 Voice Activity Detection
20 FacebookAI/xlm-roberta-large Fill-Mask
21 FacebookAI/roberta-large Fill-Mask
22 openai-community/gpt2 Text Generation
23 yikuan8/Clinical-Longformer Fill-Mask
24 sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 Sentence Similarity
25 CIDAS/clipseg-rd64-refined Image Segmentation
26 facebook/dinov2-base Image Feature Extraction
27 pyannote/speaker-diarization-3.1 Automatic Speech Recognition
28 google/electra-base-discriminator
29 facebook/esm2_t30_150M_UR50D Fill-Mask
30 distilbert/distilbert-base-uncased-finetuned-sst-2-english Text Classification
10 分钟前