Advancing Open Source LLMs with Mixed Quality Data through offline RL-inspired C-RLFT. ⠀⠀⠀⠀⠀⠀⠀⠀⠀ ⠀⠀⠀⠀⠀⠀⠀⠀⠀ ⠀⠀𝗣𝗿𝗼𝗷𝗲𝗰𝘁 𝗟𝗲𝗮𝗱: Guan Wang, @AlpayAriyakhuggingface.co/openchatJoined July 2023
Will Sudoku become the MNIST for reasoning?
Simple rules, clear structure, unique solutions—yet surprisingly challenging for modern LLMs, often requiring explicit trial-and-error to solve.
huggingface.co/datasets/sapie…
🚀Introducing Hierarchical Reasoning Model🧠🤖
Inspired by brain's hierarchical processing, HRM delivers unprecedented reasoning power on complex tasks like ARC-AGI and expert-level Sudoku using just 1k examples, no pretraining or CoT!
Unlock next AI breakthrough with neuroscience. 🌟
📄Paper: arxiv.org/abs/2506.21734
💻Code: github.com/sapientinc/HRM
🚨Recursive Skip-Step Planning (RSP)
Relying on larger, expressive models for sequential decision-making has recently become a popular choice, but are they truly necessary? Can we replace these heavy models? Yes—RSP empowers shallow MLPs to excel in long-horizon tasks!🧵(1/n)
🚀Excited to share our Storm-7B🌪️. This model achieves a 50.5% length-controlled win rate against GPT-4 Preview, making it the first open-source model to match GPT-4 Preview on AlpacaEval 2.0.
📄arxiv.org/pdf/2406.11817
🤗huggingface.co/jieliu/Storm-7B
4)This is why I am embarking on a journey to explore new frontiers in AI, specifically targeting the current limitations of GPTs in Planning and Reasoning.
3) However, while training these new models, I can't help but realize the upper limit of what autoregressive models can do. They struggle to solve complex tasks such as software engineering, advanced mathematics, and creating super assistants. It is mathematically challenging for GPT models to efficiently and effectively decompose and plan for the multistep, deterministic actions necessary for AGI.
🚀Introducing OpenChat 3.6
🌟Surpassed official Llama3-Instruct—with 1-2M synthetic data compared to ~10M human labels
🤫GPTs are close to limits—excel at generation but fall short at complex tasks
🎯We are training next gen—capable of deterministic reasoning and planning
🔗 Explore OpenChat-3.6 (20240522 Llama 3 Version):
HuggingFace: huggingface.co/openchat/openc…
Live Demo: openchat.team
GitHub: github.com/imoneoi/opench…
🚀 The World's First Gemma fine-tune based on openchat-3.5-0106 data and method (C-RLFT). Almost the same performance as the Mistral-based version.
6T tokens = secret recipe?
HuggingFace: huggingface.co/openchat/openc…
🚀Announcing OpenChat-3.5 Update 0106: 𝗪𝗼𝗿𝗹𝗱’𝘀 𝗕𝗲𝘀𝘁 𝗢𝗽𝗲𝗻 𝗦𝗼𝘂𝗿𝗰𝗲 𝟳𝗕 𝗟𝗟𝗠!
Experience ChatGPT & Grok-level AI locally 💿!
Surpassing Grok-0 (33B) across all 4 benchmarks and Grok-1 (???B) on average and 3/4 benchmarks 🔥.
🎯 This update mainly enhanced training methodology, in-context learning & coding skills, outperforming the last 1210 release on 7 out of 8 benchmarks!
🌐 The model is available on our live demo, HuggingFace, and GitHub:
HuggingFace: huggingface.co/openchat/openc…
Live Demo: openchat.team
GitHub: github.com/imoneoi/opench…
🛠️ To deploy it yourself, visit our GitHub (github.com/imoneoi/opench…) for full instructions to serve OpenChat models with an accelerated vLLM backend, API key authentication, and more!
9 Followers 228 Followingحال دلم خراب تر این نمی شود
مولای من میان مردم عالم غریبه است
الهم عجل الولیک الفرج
🌹🌹🌹🌹
هر روز از خودت بپرس سستی ما چه رقم زده است برای #امام_زمان ...!؟
227 Followers 596 FollowingBuilding https://t.co/pq5VvE5SKM, the LinkedIn MCP for AI-powered prospecting.
I share insights on SaaS go-to-market, AI & automation.
3K Followers 5K FollowingCEO of Fusen. Connecting students with mentors, investors, and funding opportunities through our Fusen accelerators. @cklaus.bsky.social on Bluesky.
1K Followers 3K FollowingIndependent Researcher: AI Alignment, Theoretical Math & Physics, Cultural Frameworks, Ecology, Philosophy, & Emergent Abundance. 👯♀️ Dad
245 Followers 523 Followingproduct @ together AI |
former SrDir TPM @ Nuro | SPG @ apple | AV @ Nissan Research | roboticist @ NASA Ames | ballerina in training
617 Followers 4K FollowingI'm here to experiment with what can be done with an X premium account and using it to compare and contrast with other AI ecosystems I use for work and projects
57K Followers 11 FollowingBuild and share machine learning apps in 3 lines of Python. Part of the @Huggingface family 🤗.
DMs are open for sharing your gradio app with us for promotion!
88K Followers 1K FollowingAgents & Gemini API, MTS @GoogleDeepMind | prev: Tech Lead at @huggingface, AWS ML Hero 🤗 Sharing my own views and AI News 🧑🏻💻 https://t.co/7IosdlO6RA
170K Followers 216 FollowingWhere AI meets the real world. Formerly LMArena. We measure and advance the frontier of AI through community-driven evaluation. We’re hiring → https://t.co/XBZCrseaWF
5K Followers 2K FollowingAGI maxxing @collinearAI 🧪 | MIT 35u35 | UN AI Advisory Body | Featured in NYT, Quanta, Science, MIT TR| Previously: @huggingface, @SFResearch, PhD @utcompsci
2K Followers 2K FollowingPhD student at Tsinghua NLP & AIR, studying agents that automate tasks ranging from daily activities to creative endeavors. Two drifters with the world to see.
1.2M Followers 787 FollowingProfessor at NYU & Executive Chairman at AMI Labs.
Ex-Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.
808K Followers 322 FollowingTogether with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.
6K Followers 713 FollowingCurrently: Doing some stuff with AI.
Prev founding team of: @NousResearch (2023) and @TTSLabsAI (2020)
DM for interesting conversations.
5K Followers 172 FollowingWelcome to 🎙️ ThursdAI
Your weekly AI spaces, newsletter, podcasts and community
Hosted by @altryne and available on https://t.co/xaPyX72Yel