So Kuroki @sharp_computer
Researcher @SakanaAILabs. LLM and user interface. sites.google.com/view/sokuroki Joined August 2016-
Tweets106
-
Followers642
-
Following678
-
Likes4K
Today, we are launching our first commercial product: Sakana Marlin, Your Virtual CSO. Marlin is an autonomous research assistant for business, built around hours of long-horizon reasoning. Try Marlin: sakana.ai/marlin Blog: sakana.ai/marlin-release… You provide a research topic, and that is the only input required. From there, Sakana Marlin works autonomously for up to roughly 8 hours. It forms hypotheses, gathers information, and verifies its own findings as it works through a vast body of material. It returns a structured set of summary slides and a research report dozens of pages long. It is designed to take on the kind of deep strategy work that a CSO and a small team might otherwise spend weeks on. Unlike instant chat or general-purpose deep research, Sakana Marlin executes a long reasoning process that unfolds over 8 hours. Underpinning it is the research we have pursued over the past two years into long-horizon reasoning and AB-MCTS, our method for coordinating multiple models to reason more effectively together. But Sakana Marlin did not come from the lab alone. It grew out of our work deploying AI agents across real industries in Japan, making it the direct product of what we have learned in both research and the field. Marlin is available today, offering a pay-per-use tier with no monthly fees, alongside Pro, Team, and Enterprise plans. It is also the first of many products to come from Sakana AI, stay tuned!
🐠 Sakana Marlin リリース 🐠 本日、Sakana AIは、初の商用プロダクトとして、ビジネス向けの自律型リサーチアシスタント「Sakana Marlin」を提供開始しました。 詳細はこちら:sakana.ai/marlin ブログ:sakana.ai/marlin-release
🐠 Sakana Marlin リリース 🐠 本日、Sakana AIは、初の商用プロダクトとして、ビジネス向けの自律型リサーチアシスタント「Sakana Marlin」を提供開始しました。 詳細はこちら:sakana.ai/marlin ブログ:sakana.ai/marlin-release
Today, we are officially launching the Sakana AI RSI Lab in Tokyo to build open-ended, adaptive AI systems that collectively self-improve. I am incredibly proud of our team’s work over the past 2 years, shipping the breakthrough research that laid the foundations for this moment. Building in Japan provides us with the ultimate design constraint. Just like Japan’s historical dominance in manufacturing was achieved by fundamentally redesigning the factory floor to do more with less, we are focused on compute-efficiency. We are not building the most compute-hungry self-improvement engine. We are building the most sample-efficient one. If you are entirely unsatisfied with the brute-force status quo and ready to build the self-improving future in Japan, come join us.
Building AI that Builds AI: Introducing the Sakana AI RSI Lab 🚀 sakana.ai/rsi-lab Today, we are announcing the Sakana AI Recursive Self-Improvement (RSI) Lab: a dedicated research group in Tokyo tasked with redesigning the AI development process itself using AI. While
Building AI that Builds AI: Introducing the Sakana AI RSI Lab 🚀 sakana.ai/rsi-lab Today, we are announcing the Sakana AI Recursive Self-Improvement (RSI) Lab: a dedicated research group in Tokyo tasked with redesigning the AI development process itself using AI. While the industry increasingly speculates about the theoretical potential of self-improving AI, we’ve spent the last two years actively laying the foundations to make it a reality: ▪ LLM²: AI models automating research to invent better preference optimization algorithms. ▪ Darwin Gödel Machine: Agents autonomously rewriting their own codebase to double software-engineering performance. ▪ ShinkaEvolve: Hyper-sample-efficient program evolution that builds novel loss functions for MoE models. ▪ ALE-Agent: Reinforcement agents outperforming hundreds of human experts via self-learning. ▪ Digital Red Queen: Open-ended adversarial coevolution laying the groundwork for RSI in cybersecurity. ▪ The AI Scientist: Towards end-to-end automation of AI research, recently published in Nature. Now, we are unifying these breakthroughs. The Sakana AI RSI Lab is officially tasked with building open-ended, adaptive architectures that collectively self-improve. Human intelligence did not emerge from limitless resources; it was forged through the open-ended, compounding process of evolution operating under strict constraints. We are applying this exact principle to AI. We believe recursive self-improvement is achievable on modest, sample-efficient compute. It shouldn’t be a winner-take-all asset locked inside hyperscale clusters, but a democratized public good. We’re scaling our team to execute this mission. We are looking for frontier scientists and engineers who are entirely unsatisfied with the brute-force status quo. If you are ready to break away from standard benchmarking and build the self-improving future in Japan, come build with us.
Several people asked about the compute required for this research. Training takes about half a day on 2-3 GPUs. Ofc we used more compute during research and experimentation, but each iteration itself is lightweight. If you can SFT a 7B model (like Moshi), you can try it too.
KAME🐢 will be presented at tomorrow’s final session! Please stop by before you leave. 5/8 2:00–4:00 PM Poster Area 30.4 #ICASSP
We’re excited to introduce KAME: Tandem Architecture for Enhancing Knowledge in Real-Time Speech-to-Speech Conversational AI, accepted at #ICASSP2026! 🐢 Blog pub.sakana.ai/kame/ Paper arxiv.org/abs/2510.02327 Can a speech AI think deeply without pausing to process? In real
#ICASSP 我々のKAME🐢論文 (SLP-P56.4)は、本日の現地時間14:00からPoster Area 30で発表されます。学習データの構成法からモデル学習の実際まで、なんでも聞いてください。私も発表者の近くにいるハズなので、よろしくお願いします。一応ヘッドホン持って行きますが、スペース十分にあるかな?
For the past few years, humans have been doing “prompt engineering” to coax the best performance out of different LLMs. In this work, we explored what happens if we train an AI to do that job instead. By training a Conductor model with RL, we found that it naturally learns to write highly effective, custom instructions for a whole pool of other models. It essentially learns to ‘manage’ them in natural language. What surprised me most was how it dynamically adapts. For simple factual questions, it just queries one model. But for hard coding problems, it autonomously spins up a whole pipeline of planners, coders, and verifiers. Really excited to see where this paradigm of “AI managing AI” goes next, especially as we start moving from single-agent chain-of-thought to multi-agent “chain-of-command”. Link to our #ICLR2026 paper: arxiv.org/abs/2512.04388 Along with our TRINITY paper which we announced earlier, this work also powers our new multi-agent system: Sakana Fugu (sakana.ai/fugu-beta) 🐡
Introducing our new work: “Learning to Orchestrate Agents in Natural Language with the Conductor” accepted at #ICLR2026 arxiv.org/abs/2512.04388 What if we trained an AI not to solve problems directly, but to act as a manager that delegates tasks to a diverse team of other AIs?
#ICML2026 に論文がacceptされました!今度は拡散言語モデルのtest-time scalingの研究です。 複数の拡散言語モデルに協力させるとコーディングや数学の能力が大きく上げられるという研究で、性能も良いしアルゴリズム自体も面白くてお気に入りの研究です。韓国でお会いしましょう🇰🇷
We’re excited to introduce KAME: Tandem Architecture for Enhancing Knowledge in Real-Time Speech-to-Speech Conversational AI, accepted at #ICASSP2026! 🐢 Blog pub.sakana.ai/kame/ Paper arxiv.org/abs/2510.02327 Can a speech AI think deeply without pausing to process? In real conversation, we don’t wait until we’ve fully worked out what we want to say—we start talking, and our thoughts catch up as the sentence unfolds. Fast speech-to-speech models achieve this, but their reasoning tends to stay shallow. Cascaded pipelines that route through a knowledgeable LLM are smarter, but the added latency breaks the flow—they fall back to "think, then speak." In our new paper, we propose a way to break this trade-off. We call it KAME (Turtle in Japanese). A speech-to-speech model handles the fast response loop and starts replying immediately. In parallel, a backend LLM runs asynchronously, generating response candidates that are continuously injected as "oracle" signals in real time. This shifts the AI paradigm from "think, then speak" to "speak while thinking." The backend LLM is completely swappable. You can plug in GPT-4.1, Claude Opus, or Gemini 2.5 Flash depending on the task without changing the frontend. In our experiments, Claude tended to score higher on reasoning, while GPT did better on humanities questions. Try the model yourself here: huggingface.co/SakanaAI/kame
kameのweightとブログ記事公開だ!!サカーナいつもありがとう Cascade な full-duplex 大好きマンだけどメチャやりたくなる huggingface.co/SakanaAI/kame
We’re excited to introduce KAME: Tandem Architecture for Enhancing Knowledge in Real-Time Speech-to-Speech Conversational AI, accepted at #ICASSP2026! 🐢 Blog pub.sakana.ai/kame/ Paper arxiv.org/abs/2510.02327 Can a speech AI think deeply without pausing to process? In real
音声AIの素早さと賢さを両立できるか? 私たち人間は会話の中で、言いたいことを全部まとめてから話し始めるのではなく、話しながら考えを整理していきます。応答の速い Speech-to-Speech モデルは、この「話しながら考える」を実現しましたが、そのぶん思考が浅くなりがちです。かといって知識豊富な LLM を挟むカスケード型では、遅延が生じるため「話しながら」が成立しません。 そこで Sakana AI は、このトレードオフを克服するKAMEモデルを開発しました。Speech-to-Speech モデルが高速な応答ループを担当し、即座に話し始めます。その裏でバックエンドの LLM が非同期に推論を進めて応答候補を生成し、それをオラクル信号としてリアルタイムに注入します。これにより「考えてから話す」ではなく「話しながら考える」ことが可能になります。 バックエンドの LLM は差し替えが可能で、タスクに応じてGPT-4.1、Claude Opus、Gemini 2.5 Flashなどを使い分けられます。フロントエンド側の変更は必要ありません。私たちの実験では、Claudeは推論系のタスクで、GPTは人文系のタスクで、それぞれ高いスコアを出す傾向が見られました。 本研究は #ICASSP2026 で発表されます。 ぜひ、お試しください。 ブログ: pub.sakana.ai/kame/ 論文: arxiv.org/abs/2510.02327 モデル: huggingface.co/SakanaAI/kame
We’re excited to introduce KAME: Tandem Architecture for Enhancing Knowledge in Real-Time Speech-to-Speech Conversational AI, accepted at #ICASSP2026! 🐢 Blog pub.sakana.ai/kame/ Paper arxiv.org/abs/2510.02327 Can a speech AI think deeply without pausing to process? In real
🐢の紹介です。重みとコードが公開されます。是非お試ししてみてください!
We’re excited to introduce KAME: Tandem Architecture for Enhancing Knowledge in Real-Time Speech-to-Speech Conversational AI, accepted at #ICASSP2026! 🐢 Blog pub.sakana.ai/kame/ Paper arxiv.org/abs/2510.02327 Can a speech AI think deeply without pausing to process? In real
音声対話モデルの研究をしていました!既存のspeech-to-speech (Moshi)と話した時に、その応答の自然さに感動する一方で、もう少しだけ賢くしたいと思ったのが研究のきっかけです。 5月のICASSPで発表します。モデル、コードも公開しているのでぜひ試してみてください!
We’re excited to introduce KAME: Tandem Architecture for Enhancing Knowledge in Real-Time Speech-to-Speech Conversational AI, accepted at #ICASSP2026! 🐢 Blog pub.sakana.ai/kame/ Paper arxiv.org/abs/2510.02327 Can a speech AI think deeply without pausing to process? In real
We’re launching the beta for our new commercial AI product: Sakana Fugu 🐡, a multi-agent orchestration system! Blog: sakana.ai/fugu-beta Fugu hits SOTA on SWE-Pro, GPQA-D, and ALE-Bench, and has been our internal secret weapon. It dynamically coordinates frontier models, autonomously selecting the optimal agent combinations and roles for each task. Available as an OpenAI-compatible API, you can seamlessly integrate Fugu into your existing workflows with minimal changes. 🐟 Fugu Mini: High-speed orchestration optimized for latency 🐡 Fugu Ultra: Full model pool utilization for deep, complex reasoning Apply for the beta test here: forms.gle/BtKkhc2CfLKk1d…
音声系のインターンを募集しています。音声インターフェースを使ったアプリケーションを作ることに興味のある人、誰か一緒に働きませんか?DMください。
サカナAI @SakanaAILabs の伊藤社長と意見交換。防衛大臣直轄の吉田AIチーム長ら職員も参加して非常に有意義な時間になりました。ありがとうございました!
Joined Sakana AI as a Research Intern 🐟 Super excited 🔥
🐟Ultra Deep Researchアシスタント「Sakana Marlin」、βテスター募集🐟 Sakana AIは、当社初の商用プロダクトとして、独自のエージェント技術によるビジネス向けAIリサーチアシスタント「Sakana Marlin」を開発しました。 sakana.ai/marlin-beta Sakana Marlinは、高度なビジネス調査を完遂する 、独自の長期推論技術に基づく自律型リサーチアシスタントです。 主な特徴 ・ テーマを与えると、8時間近くにわたり自律的にリサーチ ・ 詳細な調査ドキュメントとまとめスライドを自動生成 ・ 複数人のチームが数週間かけるプロフェッショナルな戦略調査を想定 複雑な社会情勢の中で良質な判断を下すため、AIのポテンシャルを最大限生かすソリューションとして構想しました。 本技術は、先日Nature誌にも掲載された科学的発見の自動化「AIサイエンティスト」の知見と、戦略的探索を可能にする「AB-MCTS」を融合。長く考えた分だけアウトプットの質が向上する「効率的な推論スケーリング」を実現しています。 クローズドβテストを実施します 金融機関・事業会社の経営戦略/事業企画部門、コンサルファーム、シンクタンクなど、日常的に高度なリサーチに取り組む方が対象です(期間中無料)。皆様からのフィードバックをもとに改善を重ねていきます。 ▼ クローズドβテスター応募はこちら forms.gle/MYHGP1wi2q4PHY…
Sakana Chatの公開です! 今回開発した「Namazu」モデルは、DeepSeek-V3.1等のオープンLLMに事後学習を適用したものです。優れた性能を維持しながら、日本での利用に適した振る舞いをします。Web検索機能についてもよく作り込んでいるので、日常用途には十分実用的だと思います。是非お試し下さい。
🐟 Sakana Chat 公開 🐟 Sakana AIは、Sakana Chatを無料公開しました。 chat.sakana.ai Web検索機能と高速レスポンスを備えたAIチャットです。日本国内から、どなたでもお使いいただけます。ぜひ、お試しください。
ゆきねこ🐾 @vxwh_SLD
226 Followers 846 Following 友人や家族と美味しい料理を味わいながら、充実した時間を過ごすのが大好きです。上質なコーヒーでも、シンプルな食事でも、すべての食事が人生における小さな幸せの瞬間をもたらしてくれます
Paavo @PaavoParmas
101 Followers 296 Following Machine learning and reinforcement learning researcher. エストニア人. Currently @UTokyo_News Prev: @Cambridge_Uni, @OISTedu, @GoogleDeepMind intern, @RIKEN_AIP intern
Arip @machinestein
1K Followers 773 Following
Akash Mahajan @akashmjn
641 Followers 824 Following now 🎧; prev chatting with PDFs @ContextualAI; transcription @Azure Speech; @Stanford @atherenergy @iitmadras
Kazunori Sato @kazunori_279
24K Followers 3K Following Developer Advocate, Cloud AI, Google. (The opinions expressed here by myself are my own, not those of my employer)
matrix @kk3970716697
165 Followers 977 Following
Robert Scoble @Scobleizer
587K Followers 51K Following San Francisco/Silicon Valley AI | Robots, holodecks, BCIs, analysis of new things | Ex-Microsoft, Rackspace, Fast Company | Wrote eight books about the future.
jaladdin @dataintel_
1 Followers 37 Following
t @t7766428555856
147 Followers 3K Following
Ruri Maekawa @seimy_wakuwaku
2K Followers 1K Following 東京大学/QueeenB. inc研究の自動化/3D model 🫀Georgia tech (Organoid)→UT(Spheroid)→UW(Gastruloid) Seattle 🇺🇸
Kentaro Seki / 関健... @trgkpc
3K Followers 2K Following 3rd-year doctoral student @ Univ. Tokyo Saru-lab (JSPS DC1) | audio-language model, speech synthesis, machine learning
Yasufumi Moriya @yasufumi_moriya
143 Followers 267 Following Speech Scientist at @tproworld | PhD from @AdaptCentre @DCU | MSc @EdinburghUni | speech recognition and information retrieval. Seal lover.
Taishi Nakashima (中... @_tai_shi
616 Followers 827 Following Assistant Professor at Tokyo Metropolitan University, 東京都立大学 システムデザイン学部 助教, 音源分離の研究してます🎛️
Shinnosuke Takamichi ... @forthshinji
5K Followers 394 Following Speech researcher / 音声研究者. https://t.co/f8hJL8R1Lm
Junnosuke Kamohara @jnskkmhr1218
340 Followers 897 Following Georgia Tech Robotics PhD, Bipedal locomotion @GT_LIDAR
Yuka Ko @keiouok
2K Followers 2K Following Postdoc at KIT, Karlsruhe / Interest: Speech Translation, Simultaneous Translation / Prev: Osaka Pref. Univ. (Bachelor) → NAIST (Master, Doctor, Postdoc)
zamagi x18【狐大�... @555zamagi
6K Followers 5K Following (nsfw:1.2),センシチブなものしか作らないAI negative:18歳未満 #AIart #AiArtcommunity #oppai #ai狐部 #ケモ耳鑑賞クラブ #AIケモミミ部
Guan-Ting (Daniel) Li... @GTL094144
405 Followers 705 Following Research Scientist | PhD from NTU @ntu_spml @HungyiLee2 | ex- @Meta @GoogleDeepMind @AmazonScience | Speech LM, Full-Duplex Interaction
N. TONAMI @nori_seeker
257 Followers 450 Following Researcher. Sound Event Detection/Deep Learning/Acoustic Signal Processing/Fiber-Optic Sensing. ALL opinions are my own.
Neil Zeghidour @neilzegh
6K Followers 765 Following CEO @GradiumAI. Founder of @kyutai_labs. Invented neural codecs and audio LLMs. Prev. Google DeepMind/Brain, Meta, Toha Heavy Industries.
ロン @RonAIHARA
798 Followers 765 Following メーカーの研究者。博士(工学)。 「ブレーメン音楽団」とトーン・クラスター マンドリンオーケストラの指揮者。諏訪清陵/神戸大/音声・画像信号処理/マンドリンクラブ/クラシックギター/@bremen_mandolin/柴崎利文作品集/@Tonecluster2021
비비 @amdhfirkxbdlxj
84 Followers 194 Following
Yassine El Kheir @YassineElkheir
62 Followers 603 Following PhD Student at DFKI & Technical University of Berlin
Ryuto Kitajima / F Ve... @kitanmk
220 Followers 575 Following 九大 M1 システム情報科学府 情報理工学専攻 / @Fventures_jp
Robin Scheibler @fakufakurevenge
888 Followers 938 Following Grower of cucumbers 🥒, tomatoes 🍅, and chilli peppers 🌶️. I ❤ audio, microphone arrays, IoT, Python, and data.
千葉貴史|7.15-1... @tkc_0205
4K Followers 2K Following コンタクトセンターAIXパートナー|人材営業Mgr ▶︎ SaaS営業~事業責任者~ AI-SaaS PdM/BizDev/FDE ▶︎ 営業コンサル ▶︎ Edge Works代表取締役|ENTJ(指揮官)
茶々丸|Claude ha... @vibecoder_japan
1K Followers 502 Following CAN AI 代表 営業出身&アラフォー 2025年4月からAI事業にフルベット AIOps支援 | Claude_harness|Claude code|Google|Chatgpt
Naoyoshi Aikawa @ Rim... @awakia
5K Followers 1K Following AIで本当に仕事が変わる瞬間を現場から発信。 Google→Wantedly→AI議事録SaaS Rimoのエンジニア社長、創業7年目、社員27名、調達なし年2倍成長。 AIで売上を立て、非エンジニア含む全社員で使い倒してます。デモ映えで終わらない、実務で使える話だけします。 #ClaudeCode #AI経営
RecRecog @RecRecog
15 Followers 215 Following
Seita Miyamoto🌮 @Seita_Hareta
1K Followers 2K Following 株式会社tacoms代表取締役 | 飲食業界向けAll-in-One AI Platform「Camelシリーズ ( https://t.co/pEnUjDbaUV )」 を開発しています🐪🐪 全職種全力採用中です!採用情報こちら→https://t.co/0LoVGqvEXX
Yusuf Yiğit @yigityu
90 Followers 455 Following
RYAIRYAI @RYAIRYAI281974
0 Followers 48 Following
=^-.-^= @AVeryCuriousCat
10 Followers 910 Following
yoshi @s_ayu132
117 Followers 794 Following ゲーム制作 / Sony Group → 独立/ コミュニケーションロボット/ LOVOT「ぴーすけ」を迎えてはや3年 / Tweets are my own.
Yuji NARAKI @yuji_research
486 Followers 872 Following Main: ML Engineer, Sub: Researching ML & NLP. The banana keeps spinning every day. Do you keep making progress?
Paavo @PaavoParmas
101 Followers 296 Following Machine learning and reinforcement learning researcher. エストニア人. Currently @UTokyo_News Prev: @Cambridge_Uni, @OISTedu, @GoogleDeepMind intern, @RIKEN_AIP intern
Akash Mahajan @akashmjn
641 Followers 824 Following now 🎧; prev chatting with PDFs @ContextualAI; transcription @Azure Speech; @Stanford @atherenergy @iitmadras
Nous Research @NousResearch
212K Followers 26 Following A bunch of nerds making progress toward open source AI https://t.co/vrD0aDJeto
ALIFE Conference 2026 @ALifeConf
4K Followers 935 Following The Official X Account of ALIFE Conferences. The 2026 Conference on Artificial Life - Waterloo (Canada), 17-21 August #ALIFE2026
alex zhang @a1zhang
35K Followers 929 Following phd student @mit_csail @nlp_mit, previously undergrad @princeton 🫵🏻 go participate in the @GPU_MODE kernel competitions!
Perplexity @perplexity_ai
493K Followers 76 Following Curiosity changes everything. Download our free app on iOS, Mac, Windows, and Android.
一般社団法人AI�... @airoa_jp
575 Followers 9 Following 一般社団法人AIロボット協会(AIRoA)は、AI及びロボティクスの為のオープンプラットフォームを創出しています。 English: @airoa_org
Robert Scoble @Scobleizer
587K Followers 51K Following San Francisco/Silicon Valley AI | Robots, holodecks, BCIs, analysis of new things | Ex-Microsoft, Rackspace, Fast Company | Wrote eight books about the future.
Riku Arakawa / 荒川... @rikky0611
3K Followers 2K Following 米国カーネギーメロン大学CS博士課程| ヒューマンコンピュータインタラクション (HCI) 研究 | English, Research → @hciphdstudent
Ruri Maekawa @seimy_wakuwaku
2K Followers 1K Following 東京大学/QueeenB. inc研究の自動化/3D model 🫀Georgia tech (Organoid)→UT(Spheroid)→UW(Gastruloid) Seattle 🇺🇸
Kentaro Seki / 関健... @trgkpc
3K Followers 2K Following 3rd-year doctoral student @ Univ. Tokyo Saru-lab (JSPS DC1) | audio-language model, speech synthesis, machine learning
Hirokatsu Kataoka | �... @HirokatuKataoka
6K Followers 453 Following Chief Scientist @ AIST Academic Visitor @ Oxford VGG PI @ cvpaper.challenge PI @ LIMIT.Lab 3D ResNet (Top 0.5% in CVPR) FDSL (ACCV Award / BMVC Finalist)
Carnegie Mellon Unive... @CarnegieMellon
83K Followers 2K Following United by curiosity and driven by passion, we reach across disciplines, forge new ground and deploy our expertise to make real change that benefits humankind.
Meituan @meituan
4K Followers 73 Following Meituan is the leading #ecommerce platform for services in #China. We help people eat better, live better.
ERNIE for Developers @ErnieforDevs
4K Followers 52 Following Official developer community for Baidu ERNIE, the LLM from @Baidu_Inc. Powered by @Paddlepaddle.👋New here! Discord: https://t.co/3p6zVRznLy
StepFun @StepFun_ai
11K Followers 156 Following Scale-up possibilities for everyone. OpenRouter: https://t.co/1yZfzkC2N0 HuggingFace: https://t.co/isMgbv7XSm
InclusionAI @TheInclusionAI
2K Followers 18 Following AI Lab @AntGroup, we envision AGI as humanity's shared milestone. Our Language Model @AntLingAGI and LLaDA, Embodied AI @robbyant_brain, OSS projects AReaL etc.
Kimi.ai @Kimi_Moonshot
179K Followers 136 Following Built by Moonshot AI to empower everyone to be superhuman. ⚡️API: https://t.co/XCrgjXAqMw @KimiProduct where we share cool use cases. @Kimidevs built for developers
Z.ai @Zai_org
88K Followers 258 Following The AI Lab behind GLM models, dedicated to inspiring the development of AGI to benefit humanity. https://t.co/7a5aSCUNcZ https://t.co/x14hb3klXm
Yasufumi Moriya @yasufumi_moriya
143 Followers 267 Following Speech Scientist at @tproworld | PhD from @AdaptCentre @DCU | MSc @EdinburghUni | speech recognition and information retrieval. Seal lover.
まっすー @ymas0315
2K Followers 2K Following
Taishi Nakashima (中... @_tai_shi
616 Followers 827 Following Assistant Professor at Tokyo Metropolitan University, 東京都立大学 システムデザイン学部 助教, 音源分離の研究してます🎛️
Yuka Ko @keiouok
2K Followers 2K Following Postdoc at KIT, Karlsruhe / Interest: Speech Translation, Simultaneous Translation / Prev: Osaka Pref. Univ. (Bachelor) → NAIST (Master, Doctor, Postdoc)
Junnosuke Kamohara @jnskkmhr1218
340 Followers 897 Following Georgia Tech Robotics PhD, Bipedal locomotion @GT_LIDAR
Shinnosuke Takamichi ... @forthshinji
5K Followers 394 Following Speech researcher / 音声研究者. https://t.co/f8hJL8R1Lm
Asst. Prof. Li Sheng ... @cs_lisheng
787 Followers 8K Following ◆Faculty (Science Tokyo + Kyoto Univ.) / Guest (RIKEN) ◆Spoken Language Processing ◆Welcome collaboration, discussion CV: https://t.co/naL0tJB3sI
Gradium @GradiumAI
4K Followers 1 Following The voice layer for modern apps and agents. Real-time, scalable voice APIs: TTS, STT, turn-taking & voice cloning. Devs: build → https://t.co/r5CdNClhI5
N. TONAMI @nori_seeker
257 Followers 450 Following Researcher. Sound Event Detection/Deep Learning/Acoustic Signal Processing/Fiber-Optic Sensing. ALL opinions are my own.
Guan-Ting (Daniel) Li... @GTL094144
405 Followers 705 Following Research Scientist | PhD from NTU @ntu_spml @HungyiLee2 | ex- @Meta @GoogleDeepMind @AmazonScience | Speech LM, Full-Duplex Interaction
Neil Zeghidour @neilzegh
6K Followers 765 Following CEO @GradiumAI. Founder of @kyutai_labs. Invented neural codecs and audio LLMs. Prev. Google DeepMind/Brain, Meta, Toha Heavy Industries.
ロン @RonAIHARA
798 Followers 765 Following メーカーの研究者。博士(工学)。 「ブレーメン音楽団」とトーン・クラスター マンドリンオーケストラの指揮者。諏訪清陵/神戸大/音声・画像信号処理/マンドリンクラブ/クラシックギター/@bremen_mandolin/柴崎利文作品集/@Tonecluster2021
Richard C. Suwandi @richardcsuwandi
1K Followers 989 Following PhD-ing @cuhksz. Founding committee @AIDDA_institute. Co-developed OpenEvolve, Kai @driaforall
ModelScope @ModelScope2022
9K Followers 144 Following Driving innovations with open communities. 💬 Join our Discord: https://t.co/4M9AHVh5qa
Kazuki Yamauchi @urukas4869
501 Followers 394 Following Ph.D. Student at the University of Tokyo / BOOST NAIS / Student Researcher at @GoogleDeepMind / Speech and Language Processing
LiveKit @livekit
10K Followers 22 Following Open source framework and cloud platform for building voice, video, and physical AI agents. https://t.co/OWLvFH82oN
Václav Volhejn @vvolhejn
750 Followers 154 Following Research at Kyutai by day, creative coding by night. YouTube channels: Polylog, Václav Volhejn.
Stefania Druga @Stefania_druga
13K Followers 9K Following Research Scientist @SakanaAILabs, Former Research Scientist @GoogleDeepMind AI & Multimodal LLM applications/ PhD @UW / alumni @mit @msft @Theteamatx
Naoyoshi Aikawa @ Rim... @awakia
5K Followers 1K Following AIで本当に仕事が変わる瞬間を現場から発信。 Google→Wantedly→AI議事録SaaS Rimoのエンジニア社長、創業7年目、社員27名、調達なし年2倍成長。 AIで売上を立て、非エンジニア含む全社員で使い倒してます。デモ映えで終わらない、実務で使える話だけします。 #ClaudeCode #AI経営
Robin Scheibler @fakufakurevenge
888 Followers 938 Following Grower of cucumbers 🥒, tomatoes 🍅, and chilli peppers 🌶️. I ❤ audio, microphone arrays, IoT, Python, and data.
Alkis Koudounas @AlkisKoudounas
242 Followers 632 Following Research @Sony | ex- @AmazonScience | PhD @PoliTOnews || Post-training SpeechLLMs | Trustworthy and Responsible AI
Yuto Abe @ReactYuto
240 Followers 613 Following Speech AI / Full-Duplex Dialogue Web Systems × MLOps Waseda Univ. (FSE) → Kobayashi & Ogawa Lab M2 Aspiring AI / ML Forward Deployed Engineer 和敬塾西寮/東海高⛳️🎾🎸
























