Ben Hitz @bonscotthoughts
ENCODE DCC * IGVF DACC (he/him) Joined July 2010-
Tweets1K
-
Followers121
-
Following407
-
Likes1K
@anshulkundaje @sinabooeshaghi no, colored pencils
@JD_Buenrostro Even snowflakes can be classified: its.caltech.edu/~atomic/snowcr…
@arjunrajlab Still it's clear that a "type" is an ensemble of states. Types should probably be mapped to known microscopic evidence; that is mostly how they are thought of anyway. It starts to get fuzzy when you consider sorting cells on surface markers.
@arjunrajlab This isn't a useful definition of state because it's not precise. How many tpm difference is a different state? It is not a useful definition for type because cells have multiple functions (and function itself is slippery) and they generally are not found in all combinations
@lpachter Maybe you shouldn't read articles from profit focused paper mills like ...checks notes... Nature
@danorlovsky7 What app are you using?
1/ One of the most common mistakes in bioinformatics is the one-off error. If you are not careful, you will make mistakes. Even experienced bioinformaticians are not aware of it and the mistake prevails in many bioinformatics tools. A thread 🧵
@Aella_Girl @snowanddrugs2 is this because Republican-coded laws can be avoided more easier if you just have the resources to do so? Which is why dem-coded laws I consider more egalitarian.
@akhivae How many nations did Franco's Spain invade?
@Aella_Girl actual geneticists study the specific effect of human genetic variation on a huge variety of traits, diseases and phenotypes; see. e.g.: ebi.ac.uk/gwas/
@DrChrisCombs Honestly thought this was an orange cat wearing a very fancy drone hat
@GBBranstetter That's not fair. We aren't all tranq addicts
@pranamanam Aren't all these methods essentially bad? Better than random sure, but...
One funny story about this: I spent hours creating a figure in my book explaining 0 versus 1-based indexing and closed versus right-open intervals. The illustrator thought I made a careless error in starting one from 0 and the other from 1, and changed them to match 😱
@vsbuffalo @jgschraiber One of the many terrifying things I learned from your Bioinformatics Data Skills book!
Dear big consortia, It is never too late to be brave and use all that visibility you have to make a strong statement that academia will not be held hostage by glam journals & their shiny JIFs 1/
@zenahitz @lastpositivist what am I your google monkey? en.wikipedia.org/wiki/One_Crew_…
@Caroline_Bartma SPI1 actually official gene name.
Our paper "A machine readable specification for genomics assays" is now published in Bioinformatics, @OUPBioinfo. In short, we present a lightweight file format and command-line tool to document the structure of sequencing reads. Coauthored with @XiChenUoM and @lpachter. Paper: doi.org/10.1093/bioinf… Code: github.com/pachterlab/seq… What is in my sequencing reads? Sequencing machines produce text files, called FASTQs, that contain reads or sequences of DNA molecules. Assay developers and data generators deeply understand the contents of their reads; they know the location and presence biological and synthetic constructs like cellular barcodes. Collaborators, reproducers, and other scientists may not despite their sometimes obscure addition to supplementary material. Take for example the @10xGenomics Multiome assay. The 10x Genomics documentation [1] spells out the read structure for each modality: RNA reads contain a synthetic 16bp barcode, a 12 bp "randomly" generated unique molecular barcode (UMI), as well as cDNA that was captured via polyA capture. The ATAC reads consist of genomic DNA and a 16bp cellular barcode. However the 10x Genomics website explains that the ATAC portion of the 10x Multiome data contains an little-known 8bp constant sequence spacer that proceeds the 16bp cell barcode. So saying that you have "10x Multiome" reads is a necessary but not sufficient condition to know the contents of your FASTQ reads. The reason is because the FASTQ read structure is dependent on both the assay as well as the sequencing machine/recipe used; a sequencing library produced from one single-cell assay can yield different read structures depending these parameters. Take the the 10x Multiome assay. The ATAC 24bp barcode + spacer is usually sequenced as the i5 index read. Since the NextSeq 500/550 does not support a 24bp i5 read, the user must specify "dark cycles" (10x details the impact of this [2]) to skip the 8bp spacer. This yields a 16bp cell barcode in the i5 FASTQ file. If, however, the 10x Multiome ATAC library was not sequenced with dark cycles then the i5 FASTQ file will contain a 24bp spacer + barcode. I was originally unaware of the 8bp spacer and the use of dark cycles in Multiome library sequencing. But as I was recently looking ATAC reads I realized the impact it had on my count matrices; I had been extracting half of the cell barcode and all of the spacer. This meant I was performing barcode error correction and was UMI collapsing the, mostly similar, cell barcodes to produce few counts. This decoupling of read structure between the sequencing machine and the assay places a high priority on documenting read structure in a sequencer and assay-specific manner so that preprocessing tools can accurately extract and process relevant sequenced elements. A machine-readable specification I was inspired by @XiChenUoM 's efforts (which started while in @teichlab) in documenting sequencing reads of assays and I came up with an idea to document read structure in a machine- and human-readable specification. The specification is called seqspec [3]. The specification details the structure of a YAML file that allows users to specify and annotate the types of sequences that are contained in their FASTQ data. seqspec uses a nested representation of "Regions" and "Reads" that allows users to annotate groups of sequenced elements and map sequencing reads to sequencing primers. This enables, for example, all of the elements contained in Read 1 of a FASTQ file, such as the barcode and UMI in the 10x RNA assay to be annotated as belonging to Read 1. The spec also comes with an accompanying seqspec command line tool which gives users who annotate their sequencing assays many benefits: 1. Reproducibility and verifiability of the assay structure 2. Positional extraction of relevant features 3. Visualization of the sequencing structure The seqspec command line tool makes it straightforward to extract the positional index of sequenced elements. The barcodes in the 10x Multiome dataset could have easily been identified as starting 8bp into the reads with the seqspec index command. The tool also also makes it straightforward to visualize the structure of your sequencing reads. seqspec print can produce publication-ready figure of your read structure. Most importantly, seqspec makes it easy for others to reanalyze data for which a seqspec exists, bringing about verifiability of analysis results. seqspec adoption seqspec aims to make genomics processing correct and reproducible. seqspec was recently adopted as the first standard in the @IGVFConsortium and we anticipate the publication of terabytes of sequencing data alongside their seqspec read annotations. I personally believe seqspec will be transformative for reproducibility and analysis efforts, in particular for those undertaken by consortia. I hope that public databases (like the @NCBI SRA/GEO and DDBJ) will test out seqspec and look to adopt it as a standard for data submission. seqspec is freely available, open source, open to contributions, useable, and well documented. Please take a look at the GitHub repo and try it out! We welcome feedback. [1] 10xgenomics.com/support/single… [2] kb.10xgenomics.com/hc/en-us/artic… [3] github.com/pachterlab/seq…
@RyanMarino As someone with a colonoscopy scheduled Friday I wonder the same thing
Ezequiel Fisher @ezequiel_f25702
89 Followers 3K Following
Loxurez @LoxurezKEesG
19 Followers 716 Following
Idan Gabdank @gabdank
24 Followers 31 Following
Daphne Kontogiorgos-H... @DaphneKHeintz
84 Followers 516 Following CS PhD student at @UW working on ML for single-molecule protein sequencing with @datamolecules | Prev @Penn BE and CIS
sina @sinabooeshaghi
4K Followers 1K Following bio/ai postdoc at uc berkeley. prev. phd from caltech and math + me from mit.
Rintsen N. Sherpa @RintsenSherpa
15 Followers 61 Following Bioinformatics PhD Student @theboylelab @UMich
Pinepine_US_ @PinepineU280
0 Followers 99 Following
Hadley @lykemeski51990
29 Followers 462 Following
prashant kuntala @prashkuntala
154 Followers 1K Following Computational Biology PhD Candidate in Ting Wang's lab @twang5 @WUSTLdbbs. Studying epigenetics of Exceptional Longevity & OMICs Data Visualization
Kushal K Dey @kanishkadey
812 Followers 516 Following Assistant Member/Professor, Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center
Max Libbrecht @maxlibbrecht
222 Followers 184 Following
Prachee Avasthi @PracheeAC
26K Followers 8K Following Scientist. Co-founder & CSO @ArcadiaScience. Head of Open Science @AsteraInstitute. Chasing unknown unknowns.
Erez Lieberman Aiden @erezaterez
4K Followers 1K Following American, Jewish, Scientist. Opinions are my own.
Sasha Gusev @SashaGusevPosts
22K Followers 3K Following Statistical geneticist | Associate Prof at @DanaFarber / @harvardmed / @DFCIPopSci | Blogging at https://t.co/4D7UObBNdd
Kathleen Campbell @KathleenvyCampb
22 Followers 1K Following Gathered on the site of girls from all US states 😚 Ready for private meetings Some have naked profile photos! Watching this https://t.co/wS4BkqEjvU
Michael Milton @multimeric
327 Followers 287 Following Research Computing Engineer at @WEHI_research. #Rstats, #Python, #Rustlang, #javascript, #Bioinformatics. More active on Mastodon: https://t.co/UNeZnznJmY
Connor, UC grad stude... @Connor_T_G
348 Followers 830 Following PhD candidate @UCB_Biophysics. Meyer Lab. Dosage Compensation and the 3D genome. Views my own. @uaw_4811 member.
Tejaswini Mishra @tej... @tejaswini
634 Followers 1K Following Scientist enjoying bioinformatics, precision health, longitudinal data, multi-omics profiling, wearables and space biology @Stanford Genetics :) She|her
Lior Pachter @lpachter
67K Followers 2K Following Bren Professor of Computational Biology @caltech. Blog at https://t.co/FFQzhEsmhi. Tweets represent my views, not my employer's. #methodsmatter
Mark Gerstein @MarkGerstein
8K Followers 880 Following #Bioinformatics professor, interested in #DataScience, with photography, triathlons & non-fiction books as hobbies
rozowsky @rozowsky
85 Followers 302 Following Fun loving scientist, computational biologist, Yale research scientist
Gamze Gürsoy @gamzeandgursoy
2K Followers 816 Following Department of Applied Math and Theoretical Physics, University of Cambridge
Dave ❄️🗿 @rubrogram
576 Followers 983 Following Biochemist, biophysicist, 666 star general at the 77th, he/him.
Jacob Schreiber @jmschreiber91
5K Followers 1K Following Programmable Genomics Lab @UMassGCB, Technical Steering Committee @NumFOCUS. Prev @impvienna @StanfordMed. Studying genomics, ML, and fruit. Opinions my own.
Kaili Fan @Kaili_Fan
172 Followers 272 Following Bioinformatician & computational biologist. Studying gene regulation | Postdoc in @JD_Buenrostro lab @harvard | PhD alumni of @ZhipingWeng lab @UMassChan
Chris Mungall @chrismungall
1K Followers 1K Following https://t.co/64z6EJj8LT https://t.co/0dPY8W0e5W https://t.co/3Bw5ITI2QX https://t.co/0K7OeZkppP
Brenton Graveley @graveley
1K Followers 2K Following scientist, husband, father, potter, Great Dane addict. This account is personal opinions. My professional/scientific account is @Graveleylab #istandwithfauci
Shannon Jaronik @zonk... @zonk84
110 Followers 182 Following Flying the W, Singing Spanish Techno, Upwards not Backwards, Forward not Upwards, and always Twirling, Twirling... #GoCats
Trirupa Chakraborty @Bcell_Baddie
68 Followers 168 Following PhD Student at Jishnu Das Lab, University of Pittsburgh | NHGRI IGVF consortium | Systems Immunology | Transplant | Autoimmunity
Charalampos (Harris) ... @chlazaris
333 Followers 855 Following Senior Expert I, Oncology Data Science, Novartis Biomedical Research @NovartisCancer @NovartisUS. Drugging transcription. Views are my own.
Tim Triche, Jr. (@tim... @timtriche
3K Followers 1K Following Statistician, bioinfomortician, student of🩸. Same guy as on the blue sky place except that I actually post over there. Semi-supervised by https://t.co/K2reDzLZJH
Sean Maden @MadenSean
668 Followers 4K Following Sean is a senior-level bioinformatician (he/him; views mine) #analytics #programming #python #bash #shiny #workflows #etl #genomics #rnaseq #epigeneti#verified
Swapnil Keshari @SwapnilKeshari7
19 Followers 80 Following Challenging Status Quo | Computational Biologist by work, Wanderer by passion, Designer by heart
MrKrabs @crabrangoon93
27 Followers 692 Following
Mr. Swag @MrSwagPicks
72K Followers 72K Following Sports Handicapper who has a proven and documented track record. Handicapping NFL, NCAAF, MLB, NHL, NBA and NCAAB. Profiting in Sports-Betting since 2010
Michael Love @mikelove
16K Followers 4K Following Associate Professor of Genetics and Biostatistics at UNC
jd liechty (@jdliecht... @jdliechty
111 Followers 1K Following I do stuff with computers and biology. #DCStatehood
Nikhil Reddy Poddutur... @nikhil_r_p
13 Followers 140 Following External innovation within Big Pharma
Michael 英泉 Eisen @mbeisen
73K Followers 13K Following I science & try to make science open. Tweets do not represent my employer's views, but are always correct. My conflicts of interest https://t.co/77P0LDe6VR.
Josh Welch @LabWelch
2K Followers 547 Following Computational Biologist, Associate Professor at U. Michigan
Henry Pratt @hpratt111
40 Followers 144 Following MD/PhD student in bioinformatics and computational biology, @umassmed_zlab
lost grad student, ph... @Drawrof1
5K Followers 4K Following my data runs wild and free as i have yet to gather it
Fahri Küçükali @maegsul
915 Followers 4K Following Postdoc at @SleegersLab @CMN_VIB working on complex genetics of Alzheimer's. | GWAS, molQTLs, TWAS, splicing, long reads, rare variants. https://t.co/pO7ClZPMdI🦋
akshay Balsubramani @_bakshay
768 Followers 635 Following 💻ML/AI |🧬🧪| 📈 • Biotech industry • Ex @StanfordMed, PhD @UCSD, big tech &c • 🇸🇬🇺🇸🇮🇳 • Caveat: humor
Idan Gabdank @gabdank
24 Followers 31 Following
Daphne Kontogiorgos-H... @DaphneKHeintz
84 Followers 516 Following CS PhD student at @UW working on ML for single-molecule protein sequencing with @datamolecules | Prev @Penn BE and CIS
Jason Koebler @jason_koebler
37K Followers 2K Following Journalist @404mediaco, former EIC @motherboard Signal: jason.404 // email: [email protected]
Barbara Engelhardt @BeEngelhardt
10K Followers 995 Following Machine learning, statistical genomics, ML4 health, senior investigator at Gladstone Institutes, professor at Stanford, single mom to four kids. she/her
Aella @Aella_Girl
250K Followers 409 Following whorelord, survey artist, sample size queen, way too earnest. https://t.co/IcEgPhWaSW
sina @sinabooeshaghi
4K Followers 1K Following bio/ai postdoc at uc berkeley. prev. phd from caltech and math + me from mit.
Jadson Jall, PhD @JadsonJall
5K Followers 243 Following PhD in Genetics | Community Builder | Collaborative Leadership Consultant for STEM Leaders and Organizations at the BridgUs Lab
Rintsen N. Sherpa @RintsenSherpa
15 Followers 61 Following Bioinformatics PhD Student @theboylelab @UMich
Kyle Juszczyk @JuiceCheck44
155K Followers 843 Following Harvard Grad. San Francisco 49er Fullback. Pursue Victory with Honor!
(((David Nieporent))) @dnieporent
485 Followers 365 Following Orioles fan, Attorney, libertarian (but politically homeless), NeverTrump. 🇺🇦. 🇮🇱. Full of passionate intensity. Also on the other app. 🦋 Abolish ICE.
Alexis Battle @alexisjbattle
5K Followers 432 Following Professor of Biomedical Engineering at JHU. Director, Malone Center for Engineering Healthcare. Interim co-director, Data Science and AI Inst. ML/AI, genomics
Danielle Langlois @DanielleLangWa
8K Followers 7K Following Thrilling, smart stories with heart: tribalism, corruption, the unexplained. Call your elected reps! #RankedChoiceVoting 🇺🇸 #DaniellesSocialMediaDisclaimer
Joshua Elkington @elkingtonxy
31K Followers 128 Following Founder and General Partner at Axial @axialvc
PFF SF 49ers @PFF_49ers
33K Followers 189 Following #49ers grades, statistics and analysis from PFF | @PFF -- [email protected]
Gary Bader @garybader1
2K Followers 68 Following Professor, University of Toronto https://t.co/oJeDcaLoGD https://t.co/lcSStJrc0M
prashant kuntala @prashkuntala
154 Followers 1K Following Computational Biology PhD Candidate in Ting Wang's lab @twang5 @WUSTLdbbs. Studying epigenetics of Exceptional Longevity & OMICs Data Visualization
Honest NFL @TheHonestNFL
74K Followers 5K Following Football theorist. Private consultant focusing on scheme, process, pedagogy. Vic Fangio enthusiast. Andy Reid scholar. CashApp/Venmo linked to page.
Prakash @8teAPi
56K Followers 5K Following FOLLOW ME for AI commentary; tech optimist, future shocked self-aware neuron, once fooled by superconductors;
Kushal K Dey @kanishkadey
812 Followers 516 Following Assistant Member/Professor, Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center
Max Libbrecht @maxlibbrecht
222 Followers 184 Following
Nomadic Warriors for ... @Nomads4Pritzker
18K Followers 1K Following Posting live from the Most Serene IL-Khanate of the Great Khan JB Pritzker
Ming "Tommy" Tang @tangming2005
46K Followers 3K Following Director of bioinformatics at AstraZeneca. YouTube at chatomics. On my way to helping 1 million people learn bioinformatics. Also talks about leadership.
Howard Chang @HowardYChang
6K Followers 69 Following
Xiaole Shirley Liu @XShirleyLiu
10K Followers 68 Following Computational biologist & CEO @ GV20 Therapeutics: AI-based antibody discovery and drug development, novel IO checkpoint inhibitor anti-IGSF8 in phase 2.
Shane Campbell-Staton... @SCampbellstaton
5K Followers 211 Following Evolutionary biologist | Associate Professor @Princeton | TV host, narrator, podcaster | Human Footprint and Evolution Earth now screening on @PBS
Anne Carpenter, PhD @DrAnneCarpenter
23K Followers 2K Following Discovery in images. Enthralled by science. SAB Recursion, Quiver. Building SyzOnc. Opinions my own. https://t.co/Q7nekPJPBf
Elisabeth Bik @MicrobiomDigest
144K Followers 44K Following Science integrity consultant, PhD. #ImageForensics @METRICStanford affiliate / https://t.co/2xH3WdXTFr / https://t.co/V4MR7WV9tl 🇳🇱🇺🇸
Jay Shendure @JShendure
14K Followers 320 Following Genomics technology developer. Developmental biology adult learner. Professor at @uwgenome @HHMINEWS @BrotmanBaty. He/him.
dana_peer @dana_peer
15K Followers 166 Following
Ngaio Bealum @ngaio420
24K Followers 6K Following The Dank Diplomat. Cannabis expert on #cookingonhigh. Comedian, writer, juggler, musician, parent, activist, great cook. Hard beats and soft drugs. Oon-Guy-Yo.
sanjana curtis is now... @sanjanacurtis
50K Followers 2K Following astrophysicist | @sanjanacurtis.bsky.social
Veera Rajagopal @doctorveera
21K Followers 5K Following 🇮🇳 MBBS, MD, 🇩🇰 PhD | 🧬 Co-founder & CSO @wellytics Building genetics-driven drug discovery in India Thoughts: https://t.co/QKUXWuTGfr
Gene Ontology @go@gen... @news4go
3K Followers 220 Following The Gene Ontology project provides a controlled vocabulary to describe gene and gene product attributes in any organism. Not active on Twitter, check 🦋 & 🐘
Prachee Avasthi @PracheeAC
26K Followers 8K Following Scientist. Co-founder & CSO @ArcadiaScience. Head of Open Science @AsteraInstitute. Chasing unknown unknowns.
Jonah Cool @JCoolScience
2K Followers 528 Following Cell biologist, geneticist and fascinated by all things science 🦠 🔬 . @CZIScience program officer leading single cell tech & supporting @humancellatlas
Erez Lieberman Aiden @erezaterez
4K Followers 1K Following American, Jewish, Scientist. Opinions are my own.
Lappalainen Lab @tuuliel_lab
5K Followers 578 Following Functional human genomics @ KTH, SciLifeLab, NY Genome Center. Pro science here, more fun stuff fun at @tuuliel. Mastodon [email protected]
Sasha Gusev @SashaGusevPosts
22K Followers 3K Following Statistical geneticist | Associate Prof at @DanaFarber / @harvardmed / @DFCIPopSci | Blogging at https://t.co/4D7UObBNdd
Chris Chambers @chrisdc77
21K Followers 2K Following Reproducibility and transparency of science @cardiffuni. Author: 7 Deadly Sins of Psychology. Pater trium, dominus nullius
Michael Milton @multimeric
327 Followers 287 Following Research Computing Engineer at @WEHI_research. #Rstats, #Python, #Rustlang, #javascript, #Bioinformatics. More active on Mastodon: https://t.co/UNeZnznJmY
biohub @biohub
29K Followers 601 Following Combining frontier AI & frontier biology to help scientists cure or prevent disease
Stefano Mangiola @steman_research
971 Followers 158 Following Computational cancer-immunology and tumour microenvironment @SAiGENCI (co-appointment @WEHI_research), VCA fellow @VicGovDH
Connor, UC grad stude... @Connor_T_G
348 Followers 830 Following PhD candidate @UCB_Biophysics. Meyer Lab. Dosage Compensation and the 3D genome. Views my own. @uaw_4811 member.













