Alessandro Ferrari @BioAlessandro

MS@Harvard, BS@GaTech, Until Labs | 8VC fellow | prev @ BigHat Bio, Maverick Metals (YC S22), Biocentis alessandroferrari.live SF & NYC Joined October 2023

Tweets

259
Followers

286
Following

584
Likes

6K

Alessandro Ferrari @BioAlessandro

a week ago

I asked Fable 5 to work on your frontier AI research and it had no problem at all dude…

0 0 0 48 0

View Details

Alessandro Ferrari @BioAlessandro

2 weeks ago

When Claude is wrong and you correct it but thankfully it’s only wrong on things you’re very knowledgeable about, not on things you only have surface knowledge of.

0 0 1 52 0

View Details

Alessandro Ferrari @BioAlessandro

a month ago

@GaddipatiHarsha How about making a product that will receive upvotes? That feels like a better use of YC resources😂

1 0 3 179 0

View Details

Alessandro Ferrari @BioAlessandro

2 months ago

New X banner

NASA's Kennedy Space Center @NASAKennedy

2 months ago

The planet can spell your name – literally. 🔤🌍 This Earth Day, see your name written in landscapes captured by Landsat: go.nasa.gov/4ak4Cdu

2K 28K 184K 56.6M 77K

0 0 3 254 0

View Details

Alessandro Ferrari @BioAlessandro

2 months ago

@javi_2326 @andrewgwils Oh don’t worry, I’m not a math or physics major. Unfortunate reality for me as well.

0 0 0 75 0

View Details

Alessandro Ferrari @BioAlessandro

2 months ago

@viennaCtrl_ @andrewgwils Everything is math

0 0 0 76 0

View Details

Alessandro Ferrari @BioAlessandro

2 months ago

@predict_addict @andrewgwils Because I’m a cs major and consciously chose not to do math and physics knowing it would have probably been better. Hindsight 20/20

1 0 1 96 0

View Details

Alessandro Ferrari @BioAlessandro

2 months ago

@adityaxprasad “Ex sweatshop worker, able to pull 20 hour days”

0 0 2 90 0

View Details

Alessandro Ferrari @BioAlessandro

2 months ago

@adityaxprasad Biohacking

0 0 0 40 0

View Details

Alessandro Ferrari @BioAlessandro

2 months ago

To settle the "buy GPUs vs rent" debate for side projects: once you buy that RTX PRO 6000, you will procrastinate and let it idle. If you're renting it for $2/hr, you will be more productive working on your side project than you will at your day job.

0 0 3 418 0

View Details

Alessandro Ferrari @BioAlessandro

2 months ago

@datavorous_ An agent orchestration harness with a CEO, engineers, and validator agents enabling devs to push 150kloc/day? Seems like the right next step if this kid wants to stop shipping weekend side projects and make a real impact on the world!

0 0 1 126 0

View Details

Alessandro Ferrari @BioAlessandro

2 months ago

@quantian1 Yea, the same way everyone can trivially setup FTP with a CVS system on top and recreate Dropbox

1 1 18 10K 1

View Details

Alessandro Ferrari @BioAlessandro

2 months ago

PrisML team iteratively adding bits each couple weeks until they land on bf16 from first principles

Sahin Lale @SahinLale

2 months ago

Turns out adding 0 helps :) Today we’re introducing Ternary Bonsai 🌳, a family of end-to-end 1.58-bit language models in 8B, 4B, and 1.7B sizes. Ternary Bonsai 8B is within 5% of Qwen 3 8B at 9x lower memory. Still tiny. Noticeably smarter

9 13 200 18K 56

0 0 3 392 0

View Details

Alessandro Ferrari @BioAlessandro

2 months ago

The european mind (me) cannot comprehend that you would see an $18 flight ticket and your first instinct is buy all of them for $3400 Well played, well played

Alex Kehr @alexkehr

2 months ago

the american mind (me) cannot comprehend european airline flight prices can i just book all 190 seats for $3400 and have a private 737 flight?

602 292 37K 12.6M 1K

0 0 5 369 0

View Details

Alessandro Ferrari @BioAlessandro

2 months ago

I respect the A/B testing

Jessica Paquette @barrelshifter

2 months ago

corrupting the youth by having them fix random llvm bugs

4 11 126 4K 5

0 0 2 211 0

View Details

Justin Xia @justinqxia

2 months ago

Renting H100s from runpod to write tinygrad bounties like a medieval peasant paying a tithe to his feudal lord for a meager plot of compute. I toil day and night, hoping my bounty harvest is enough to win the respect of the king and avoid starvation

2 2 15 656 1

View Details

Alessandro Ferrari @BioAlessandro

2 months ago

@itselouardi 😂have to keep it for intellectual honesty. I fucked up

1 0 4 64 0

View Details

Alessandro Ferrari @BioAlessandro

2 months ago

It's nice that we could get Bonsai-family support so quickly, but this is a bit disingenuous. I have never contributed to tinygrad so I am not in a position to critique this, however this implementation unpacks the 1bit weights as float16 and runs computations on float16 instead of running custom kernels on the packed weights, nullifying a lot of the benefits of the Bonsai architecture. Q1_0 It is a packed 1-bit format: for each block of 128 weights, you store 16 bytes of bits and 2 bytes for a shared fp16 scale. 128 weights take 18 bytes total. If you unpack those same 128 weights into float16, that becomes 256 bytes (14x). This is basically unpacking the "bit-based llm" in normal float16 and running calculations that way. My understanding of that llama.cpp’s Bonsai support keeps the weights in the quantized Q1_0 representation and uses kernels that operate on that packed format, which is the whole point. Again, I do not mean this as a shot at the implementation itself. Getting support working this quickly is genuinely cool. I might also be misunderstanding some parts of this, hopefully not too much, but would love to be corrected.

the tiny corp @tinygrad

2 months ago

Just merged an external PR for Bonsai-8B support (1 bit LLM). Because tinygrad has the correct abstractions, it was 5 lines. huggingface.co/prism-ml/Bonsa… github.com/tinygrad/tinyg…

7 12 256 36K 70

3 1 38 15K 18

View Details

Alessandro Ferrari @BioAlessandro

2 months ago

Yea, just replied as well, I was totally misunderstanding. For some reason I thought that the ggml loading to tensor wouldn't be fused with the rest of the code (not sure why I would think that) so the scheduler only saw the multiplication with the float16 d, and separately saw the rest of the model. The memory usage doesn't lie

0 0 1 216 0

View Details

Alessandro Ferrari @BioAlessandro

2 months ago

@__tinygrad__ Oh wow, yea absolutely. I mistakenly thought that the multiplication with d in the loader would result in a cast to float16, but it gets fused and never directly unpacked. this library is beautiful