<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Sequence & Destroy by Evan Peikon]]></title><description><![CDATA[Sequence & Destroy by Evan Peikon]]></description><link>https://sequenceanddestroy.substack.com</link><image><url>https://substackcdn.com/image/fetch/$s_!_2A6!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F761851cd-6851-47e8-a60e-9f3312b7a42c_618x618.png</url><title>Sequence &amp; Destroy by Evan Peikon</title><link>https://sequenceanddestroy.substack.com</link></image><generator>Substack</generator><lastBuildDate>Sun, 31 May 2026 16:10:45 GMT</lastBuildDate><atom:link href="https://sequenceanddestroy.substack.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Evan Peikon]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[decodingbiology@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[decodingbiology@substack.com]]></itunes:email><itunes:name><![CDATA[Evan Peikon]]></itunes:name></itunes:owner><itunes:author><![CDATA[Evan Peikon]]></itunes:author><googleplay:owner><![CDATA[decodingbiology@substack.com]]></googleplay:owner><googleplay:email><![CDATA[decodingbiology@substack.com]]></googleplay:email><googleplay:author><![CDATA[Evan Peikon]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Issue #76// What LLMs Get Wrong About Multiple Testing Correction]]></title><description><![CDATA[A Guide to Multiple Testing Correction for Computational Biologists.]]></description><link>https://sequenceanddestroy.substack.com/p/issue-76-what-llms-get-wrong-about</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-76-what-llms-get-wrong-about</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Sun, 17 May 2026 10:01:39 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/c0e8ea27-1a21-4c4b-b9fc-a48fccf26d0e_1384x814.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3C-9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3C-9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 424w, https://substackcdn.com/image/fetch/$s_!3C-9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 848w, https://substackcdn.com/image/fetch/$s_!3C-9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 1272w, https://substackcdn.com/image/fetch/$s_!3C-9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3C-9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png" width="1456" height="308" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:308,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:277062,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/196584918?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!3C-9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 424w, https://substackcdn.com/image/fetch/$s_!3C-9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 848w, https://substackcdn.com/image/fetch/$s_!3C-9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 1272w, https://substackcdn.com/image/fetch/$s_!3C-9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><h2>Issue &#8470; 76 // What LLMs Get Wrong About Multiple Testing Correction</h2><p>In <a href="https://sequenceanddestroy.substack.com/p/issue-73-the-orchestrators-edge">The Orchestrator&#8217;s Edge</a> I wrote:</p><blockquote><p>When you&#8217;re no longer capable of tracing every line of AI-generated code back to your own reasoning, your ability to read, interrogate, and sanity-check what a machine produces becomes the last line of defense against silent compounding errors, which ultimately determines the validity of downstream scientific results.</p></blockquote><p>The word <em>downstream</em> is key here. The most consequential errors in a computational biologist&#8217;s analysis pipeline aren&#8217;t the ones that crash their script. They&#8217;re the ones that embed incorrect methodologies into an analysis without raising red flags.  </p><p>Consider something as routine as differential expression analysis on proteomic data. If you ask an LLM to write the code for you, it will. But what decisions did the model make along the way? Did it opt for a t-test when the data&#8217;s non-normal distribution called for a Wilcoxon rank-sum test? Did it apply a Perseus-style filter you didn&#8217;t want? And when it corrected for multiple comparisons, did it choose Bonferroni or Benjamini-Hochberg?</p><p>The computational biologist who can spot wrong turns&#8212;or better yet, who has enough methodological intuition to write a prompt that forecloses bad choices before they happen&#8212;is the one whose science will hold up best in the long run. There&#8217;s no shortcut to building this skill; you really do need to learn the ins and outs of your methods and when certain contexts elevate one over another. In our world that demands instant results, there&#8217;s the temptation to try and hack this process by asking AI to bang out some code using &#8220;gold-standard&#8221; analysis methods, bypassing the need to write detailed and informed prompts. Unfortunately, this isn&#8217;t a winning strategy.</p><p>In today&#8217;s piece, I&#8217;m going to focus on one consequential choice that comes up constantly when analyzing multiomic data: whether to use the Benjamini-Hochberg (BH) or Bonferroni procedure for multiple testing correction. Ask an LLM to write a differential expression analysis pipeline and it will almost always default to Benjamini-Hochberg. This almost certainly reflects the composition of its training data rather than a principled methodological choice&#8212;BH dominates the published bioinformatics literature, so models trained on that literature will reproduce the pattern. Nine times out of ten, that default is actually correct. But when it isn&#8217;t, the mistake can carry serious clinical repercussions. After reading this piece you&#8217;ll understand why, and how you can make more informed decisions in your own work.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Both the Benjamini-Hochberg method and Bonferroni correction are multiple testing procedures used to reduce the likelihood of type I errors (i.e., false positives), which is the risk of saying a finding significant when it is not. However, each method asks a slightly different question, and therefore involves a different tradeoff. Benjamini-Hochberg asks &#8220;<em>what fraction of my hits are false positives, and is that fraction acceptable?&#8221;</em> Bonferroni by contrast asks &#8220;<em>am I certain that none of these results are false positives?&#8221;</em> Keeping this distinction in mind will help you understand when one method is more appropriate than the other. But before we get into those differences, it&#8217;s worth discussing how running many statistical tests increases the risk of type I errors in the first place.</p><p>Suppose we want to run a Wilcoxon rank-sum test to determine whether the expression of AKT T308 differs between treatment responders and non-responders in triple negative breast cancer. Before running the analysis, we set an alpha value (&#945;), which is the threshold used to determine whether our result is statistically significant. A common alpha value is 0.05, meaning that if our p-value is less than 0.05, we consider the result significant.</p><p>Any time we run a statistical test, the risk of a type I error equals alpha. So, in the example above, the probability that we incorrectly conclude AKT T308 expression differs between groups is 5%. Now consider that we&#8217;re rarely testing just one protein. Often we&#8217;ll compare the abundances of hundreds of proteins, if not more. When we run that many tests, the probability that at least one result is a false positive grows quickly, and can be quantified with the family-wise error rate formula 1&#8722;(1&#8722;&#945;)&#8319;. Using this formula, we can see that with 1, 5, and 20 tests, the risk of at least one false positive is 5%, 23%, and 64%, respectively. By around 100 tests, the probability of at least one false positive approaches 100%, as you can see in the figure below. If we&#8217;re comparing hundreds of proteins or thousands of genes, false positives are essentially guaranteed.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FFPf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F178e4bd0-7dbe-4fb7-9751-17557c763e02_1004x604.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FFPf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F178e4bd0-7dbe-4fb7-9751-17557c763e02_1004x604.png 424w, https://substackcdn.com/image/fetch/$s_!FFPf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F178e4bd0-7dbe-4fb7-9751-17557c763e02_1004x604.png 848w, https://substackcdn.com/image/fetch/$s_!FFPf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F178e4bd0-7dbe-4fb7-9751-17557c763e02_1004x604.png 1272w, https://substackcdn.com/image/fetch/$s_!FFPf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F178e4bd0-7dbe-4fb7-9751-17557c763e02_1004x604.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FFPf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F178e4bd0-7dbe-4fb7-9751-17557c763e02_1004x604.png" width="458" height="275.52988047808765" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/178e4bd0-7dbe-4fb7-9751-17557c763e02_1004x604.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:604,&quot;width&quot;:1004,&quot;resizeWidth&quot;:458,&quot;bytes&quot;:49792,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/197744132?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34cf5328-3599-419e-8dd4-89b29d1cc1e0_1004x604.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FFPf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F178e4bd0-7dbe-4fb7-9751-17557c763e02_1004x604.png 424w, https://substackcdn.com/image/fetch/$s_!FFPf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F178e4bd0-7dbe-4fb7-9751-17557c763e02_1004x604.png 848w, https://substackcdn.com/image/fetch/$s_!FFPf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F178e4bd0-7dbe-4fb7-9751-17557c763e02_1004x604.png 1272w, https://substackcdn.com/image/fetch/$s_!FFPf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F178e4bd0-7dbe-4fb7-9751-17557c763e02_1004x604.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The purpose of multiple testing correction is to reduce this risk of false positives as the number of tests increases. In practice, both the Benjamini-Hochberg and Bonferroni procedures accomplish this, but through different mechanisms and with different tradeoffs, as we&#8217;ll expand on below. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>The Benjamini-Hochberg method controls the false discovery rate (FDR)&#8212;the expected proportion of false positives among all results we call significant. The procedure works as follows:</p><ol><li><p>Rank all individual p-values in ascending order p(1) &#8804; p(2) &#8804; &#8230; &#8804; p(<em>m</em>), where <em>m</em> in the number of statistical tests/comparisons we run. </p></li><li><p>Assign a rank <em>i</em> to to each ordered p-value, where <em>i</em>=1 is the smallest p-value and <em>i</em>=<em>m </em>is the largest. </p></li><li><p>Choose your acceptable FDR level (Q). A standard value is 0.05, meaning we tolerate up to 5% false positives among significant results.</p></li><li><p>Calculate the BH critical value corresponding to each p-value using the following formula: critical value = <em>i/m</em> * <em>Q. </em></p></li><li><p>Find the largest p-value that is less than or equal to its corresponding critical value. That p-value and all smaller ones are declared significant.</p></li></ol><p>As an example, suppose we&#8217;ve run 7 statistical tests and have the following ordered p-values: 0.01 &#8804; 0.02 &#8804; 0.04 &#8804; 0.048 &#8804; 0.05 &#8804; 0.057 &#8804; 0.06. Setting Q = 0.05, the BH critical values for ranks 1 through 7 are: 0.007, 0.014, 0.021, 0.029, 0.036, 0.043, 0.050. Comparing each p-value to its critical value, we find that none of the p-values are less than or equal to their corresponding critical value, so none of the results survive BH correction, despite several being below the naive p&lt;0.05 threshold.</p><p>Using this method, you can see how the more tests you do, the harder it is for any given p-value to be significant after multiple testing correction. The chart below shows p-value rank plotted against critical thresholds. When m=2, the first (smallest) ordered p-value being significant at p&lt;0.025, but as m increases we see this value getting smaller and smaller. By the time we reach m=6 we&#8217;re already at the point where p must be &lt;0.01 to be significant. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RM44!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f830efc-8e00-4443-a059-3cad9fdf9f12_1270x610.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RM44!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f830efc-8e00-4443-a059-3cad9fdf9f12_1270x610.png 424w, https://substackcdn.com/image/fetch/$s_!RM44!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f830efc-8e00-4443-a059-3cad9fdf9f12_1270x610.png 848w, https://substackcdn.com/image/fetch/$s_!RM44!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f830efc-8e00-4443-a059-3cad9fdf9f12_1270x610.png 1272w, https://substackcdn.com/image/fetch/$s_!RM44!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f830efc-8e00-4443-a059-3cad9fdf9f12_1270x610.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RM44!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f830efc-8e00-4443-a059-3cad9fdf9f12_1270x610.png" width="596" height="286.26771653543307" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6f830efc-8e00-4443-a059-3cad9fdf9f12_1270x610.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:610,&quot;width&quot;:1270,&quot;resizeWidth&quot;:596,&quot;bytes&quot;:163827,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/197744132?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F05c9d30e-a841-4b2c-babc-7a6e28ddaf10_1270x610.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!RM44!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f830efc-8e00-4443-a059-3cad9fdf9f12_1270x610.png 424w, https://substackcdn.com/image/fetch/$s_!RM44!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f830efc-8e00-4443-a059-3cad9fdf9f12_1270x610.png 848w, https://substackcdn.com/image/fetch/$s_!RM44!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f830efc-8e00-4443-a059-3cad9fdf9f12_1270x610.png 1272w, https://substackcdn.com/image/fetch/$s_!RM44!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6f830efc-8e00-4443-a059-3cad9fdf9f12_1270x610.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The algorithm procedure above gives us a binary yes/no output for each tests&#8212;is it significant or not? The reason I introduced this first is that it helps build intuition for what BH correction is doing, but there is actually a simpler way to arrive at the same conclusion. One that produces a reportable adjusted p-value.  To perform this method, we first rank order our p-values as we did previously, where p(1) &#8804; p(2) &#8804; &#8230; &#8804; p(m) and m is the total number of tests. Then, for each of your p-values, use the following formula: </p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\text{Adjusted p-value} = \\text{Original p-value} * \\frac{m}{I}&quot;,&quot;id&quot;:&quot;AXTWHNKRDN&quot;}" data-component-name="LatexBlockToDOM"></div><p>where <em>i</em> is the ordered ran and <em>i</em>=1 is the smallest p-value. However, after computing the adjusted p-value for all original p&#8217;s, we then apply a step-down forcing rule from the largest to smallest value to ensure the adjusted sequence never decreases as rank decreases. To definition for this rule as follows:</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;q_i=min(q_{I+1}, p_i*\\frac{m}{i})&quot;,&quot;id&quot;:&quot;BCSQRYRAEO&quot;}" data-component-name="LatexBlockToDOM"></div><p>where you begin by applying the method to the largest p-value (called the base case) and work backwards to the smallest value. Using the same raw p-values as above, we now get the following results: </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!v6kg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5b71d3f-218e-498f-86a1-8a5560658656_1230x1164.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!v6kg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5b71d3f-218e-498f-86a1-8a5560658656_1230x1164.png 424w, https://substackcdn.com/image/fetch/$s_!v6kg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5b71d3f-218e-498f-86a1-8a5560658656_1230x1164.png 848w, https://substackcdn.com/image/fetch/$s_!v6kg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5b71d3f-218e-498f-86a1-8a5560658656_1230x1164.png 1272w, https://substackcdn.com/image/fetch/$s_!v6kg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5b71d3f-218e-498f-86a1-8a5560658656_1230x1164.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!v6kg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5b71d3f-218e-498f-86a1-8a5560658656_1230x1164.png" width="486" height="459.9219512195122" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c5b71d3f-218e-498f-86a1-8a5560658656_1230x1164.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1164,&quot;width&quot;:1230,&quot;resizeWidth&quot;:486,&quot;bytes&quot;:146773,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/197744132?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5b71d3f-218e-498f-86a1-8a5560658656_1230x1164.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!v6kg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5b71d3f-218e-498f-86a1-8a5560658656_1230x1164.png 424w, https://substackcdn.com/image/fetch/$s_!v6kg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5b71d3f-218e-498f-86a1-8a5560658656_1230x1164.png 848w, https://substackcdn.com/image/fetch/$s_!v6kg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5b71d3f-218e-498f-86a1-8a5560658656_1230x1164.png 1272w, https://substackcdn.com/image/fetch/$s_!v6kg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5b71d3f-218e-498f-86a1-8a5560658656_1230x1164.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>When the adjusted p-value (aka q-value) is smaller than our original <em>&#945;,</em> then the adjusted p-value is significant. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>The Bonferroni procedure controls the family-wise error rate (FWER)&#8212;the probability of making even a single false positive across all tests performed. Rather than tolerating a controlled fraction of false positives as BH does, Bonferroni aims to ensure that the entire set of conclusions contains no false positives at all<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. There are two ways to do this procedure, the first being the decision gate method, which scaled down the target significance (<em>&#945;)</em> value based on the number of tests. To do this, we use the formula:</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;&#945;_{new} = \\frac{&#945;}{m}&quot;,&quot;id&quot;:&quot;JSHGOTLBWZ&quot;}" data-component-name="LatexBlockToDOM"></div><p>We then reject the null hypothesis for any test whose raw p-value is &#8804; &#945;_new. For example, with 20 tests and &#945; = 0.05, the corrected threshold drops to 0.0025. With 100 tests, it drops to 0.0005, as you can see demonstrated in the chart below. As the number of comparisons grows into the hundreds or thousands&#8212;as is routine in proteomics and genomics&#8212;only the very strongest signals will clear this threshold. This makes Bonferroni highly conservative: it substantially reduces the risk of any false positive, but at the cost of a higher type II error rate (i.e., missing real findings).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!BAaF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F025e94d3-ac16-4ab9-898d-7979419cae6c_1012x618.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BAaF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F025e94d3-ac16-4ab9-898d-7979419cae6c_1012x618.png 424w, https://substackcdn.com/image/fetch/$s_!BAaF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F025e94d3-ac16-4ab9-898d-7979419cae6c_1012x618.png 848w, https://substackcdn.com/image/fetch/$s_!BAaF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F025e94d3-ac16-4ab9-898d-7979419cae6c_1012x618.png 1272w, https://substackcdn.com/image/fetch/$s_!BAaF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F025e94d3-ac16-4ab9-898d-7979419cae6c_1012x618.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BAaF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F025e94d3-ac16-4ab9-898d-7979419cae6c_1012x618.png" width="551" height="336.4802371541502" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/025e94d3-ac16-4ab9-898d-7979419cae6c_1012x618.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:618,&quot;width&quot;:1012,&quot;resizeWidth&quot;:551,&quot;bytes&quot;:49214,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/197744132?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2335e6dd-1f1c-4f27-9757-6a129e7c0bf2_1012x618.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!BAaF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F025e94d3-ac16-4ab9-898d-7979419cae6c_1012x618.png 424w, https://substackcdn.com/image/fetch/$s_!BAaF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F025e94d3-ac16-4ab9-898d-7979419cae6c_1012x618.png 848w, https://substackcdn.com/image/fetch/$s_!BAaF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F025e94d3-ac16-4ab9-898d-7979419cae6c_1012x618.png 1272w, https://substackcdn.com/image/fetch/$s_!BAaF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F025e94d3-ac16-4ab9-898d-7979419cae6c_1012x618.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Bonferroni correction can also be expressed as an adjusted p-value, which allows direct comparison to your original alpha. For each raw p-value, the Bonferroni-adjusted p-value is:</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\text{Adjusted p-value} = min(1, p_i*m). &quot;,&quot;id&quot;:&quot;JGOGEHQOGM&quot;}" data-component-name="LatexBlockToDOM"></div><p>So, if your original p-value is 0.02 and m=10, our new p-value is 0.2. We can then compare this to alpha, where we get 0.2 &gt; 0.05, therefore the result is not significant. Working backwards, you&#8217;d need a raw p-value of 0.005 or smaller to achieve significance at &#945; = 0.05 with 10 tests&#8212;a demanding standard that grows more demanding with every additional comparison.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Whether you choose BH or Bonferroni comes down to what kind of error you&#8217;re more willing to tolerate. BH maximizes discovery&#8212;it finds more true positives, but accepts that some fraction of your significant results will be false. Bonferroni by contrast minimizes the chance of any false positive, but increases the risk of missing real effects.</p><p>For most exploratory bioinformatics analyses, BH is the appropriate choice. Proteomics and transcriptomics studies are typically hypothesis-generating, meaning their findings inform downstream validation efforts, not immediate clinical action. In this context, a false positive that gets filtered out in the next experiment is a manageable cost, and the greater statistical power of BH ensures you&#8217;re not missing real biology. There&#8217;s also an interpretive buffer built into pathway-level analyses: if AKT, mTOR, 4EBP1, and eIF4E all appear significantly elevated in TNBC non-responders, a single false positive among them doesn&#8217;t materially change the conclusion that the pathway is activated.</p><p>Our reasoning changes though when results directly inform clinical decision making. Imagine a clinical proteomics study identifying biomarkers to stratify patients onto different treatment arms in a cancer trial. A false positive in that context&#8212;a protein that appears to predict response but doesn&#8217;t&#8212;could result in patients receiving an ineffective or harmful treatment. Here, the acceptable false positive rate is not 5% of your hits; ideally it&#8217;s zero. Bonferroni correction, with its strict family-wise error rate control, is the appropriate tool when the downstream consequence of a single false positive is clinical harm rather than a failed replication.</p><p>Practically, we can use a simple heuristic to determine which test is appropriate: use BH for discovery-stage analyses where findings will be subject to further validation, and default to Bonferroni when results will directly inform clinical or regulatory decisions. This is the type of methodological judgment you don&#8217;t want an LLM to make for you. It doesn&#8217;t know whether your analysis is exploratory or consequential, or whether your hits will be validated in a follow-up experiment or used to stratify patients in a trial. When you prompt an AI to write your analysis pipeline, you need to supply that context and to do that, you need to understand the choice well enough to know it matters. That&#8217;s the irreducible core of what it means to be a computational biologist in the age of AI-assisted science.</p><div class="callout-block" data-callout="true"><p><strong>Thanks for reading!</strong> If you found this post useful, please consider subscribing or sharing it with a friend. I regularly shared hands-on computational biology techniques, fresh ways to think about tough problems, and perspectives on a range of related topics.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://sequenceanddestroy.substack.com/subscribe?"><span>Subscribe now</span></a></p><p></p></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>This reflects a  tension in statistical hypothesis testing: type I and type II error rates move in opposite directions. Reducing the probability of false positives (type I errors)&#8212;as Bonferroni does aggressively&#8212;necessarily increases the probability of false negatives (type II errors), meaning real effects that fail to clear the more stringent threshold go undetected. BH correction sits at a more permissive point on this tradeoff, accepting a controlled proportion of false positives in exchange for greater statistical power to detect true effects. There is no procedure that minimizes both error types simultaneously; the choice between BH and Bonferroni is ultimately a choice about which kind of mistake your scientific context can better tolerate.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Issue #75 // An Intuitive Take on PCA For Biologists ]]></title><description><![CDATA[What's the purpose of principle component analysis and what do principle components represent from a biological perspective?]]></description><link>https://sequenceanddestroy.substack.com/p/issue-75-an-intuitive-take-on-pca</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-75-an-intuitive-take-on-pca</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Sun, 10 May 2026 10:01:05 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/aee84426-b917-43dc-8fb4-7f9a42919ac4_1432x844.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3C-9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3C-9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 424w, https://substackcdn.com/image/fetch/$s_!3C-9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 848w, https://substackcdn.com/image/fetch/$s_!3C-9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 1272w, https://substackcdn.com/image/fetch/$s_!3C-9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3C-9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png" width="1456" height="308" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:308,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:277062,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/196584918?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!3C-9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 424w, https://substackcdn.com/image/fetch/$s_!3C-9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 848w, https://substackcdn.com/image/fetch/$s_!3C-9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 1272w, https://substackcdn.com/image/fetch/$s_!3C-9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>Principle component analysis is a notoriously confusing topic for beginners in computational biology and applied ML. While the purpose of PCA&#8212;summarizing and compressing high-dimensional data&#8212;is relatively easy to understand, technical explanations are often overloaded with linear algebra, making them unwieldy and overwhelming. Personally, even after I had a firm grip on the math underlying PCA, I had trouble understanding what PCA was <em>really</em> doing to my data from a biological perspective. For example, after compressing 1,000 protein&#8217;s abundances into 30 principle components, what was I left with? More importantly, how should I reason about these abstract features when my ultimate goal is to <a href="https://sequenceanddestroy.substack.com/p/issue-74-a-cluster-isnt-a-category">cluster or classify</a> patients? That&#8217;s what this article is about&#8212;providing an intuitive take on principle component analysis for biologists. </p><div><hr></div><h2>Issue &#8470; 75 // An Intuitive Take on PCA For Biologists </h2><p>This article grew out of a simple question: <strong>practically speaking, what&#8217;s the purpose of principle component analysis (PCA) in computational biology and bioinformatics? </strong>The academic, and wholly unsatisfying, answer to this question is that PCA finds the directions of maximum variance in your data, transforming a set of high-dimensional correlated variables&#8212;for example, a bunch of protein&#8217;s abundances&#8212;into a smaller set of uncorrelated features, or principle components. For this reason, PCA is considered a form of dimensionality reduction, which is a smart sounding way of saying it reduces the number of input variables (features) in a dataset while retaining as much relevant information as possible. However, because lower-variance features are discarded, PCA is lossy, meaning you can never perfectly reconstruct your original dataset after it&#8217;s performed. Instead, what you&#8217;re left with is a series of orthogonal principle components starting with PC1, which captures the highest variance, then PC2, PC3, and so forth (we&#8217;ll come back to this shortly). </p><p>As a practical example, suppose we have a dataset with 50 protein&#8217;s values, each measured in an arbitrary number of breast cancer patients. This feature set corresponds to a 50-dimensional space, which while small in the grand scheme of bioinformatics analyses is impossible to directly visualize or reason about nonetheless. To make this concrete, imagine you wanted to visualize your heart rate data (a 1-dimensional data point) following a trail run. This could easily be viewed with a line graph. Now, let&#8217;s say you want to see two data points in relation to one another&#8212;heart rate vs. pace&#8212;this can be visualized on an x-y axis (i.e., a plane). Three? An x-y-z coordinate system. Anything beyond three dimensions though and we&#8217;re lost&#8212;our brains lack the ability to perceive a fourth spatial dimension and beyond, making it impossible to form a mental image of it.  </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!IZuy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98141f39-7627-4a46-9f78-1041f8a5cc83_1596x622.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!IZuy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98141f39-7627-4a46-9f78-1041f8a5cc83_1596x622.png 424w, https://substackcdn.com/image/fetch/$s_!IZuy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98141f39-7627-4a46-9f78-1041f8a5cc83_1596x622.png 848w, https://substackcdn.com/image/fetch/$s_!IZuy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98141f39-7627-4a46-9f78-1041f8a5cc83_1596x622.png 1272w, https://substackcdn.com/image/fetch/$s_!IZuy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98141f39-7627-4a46-9f78-1041f8a5cc83_1596x622.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!IZuy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98141f39-7627-4a46-9f78-1041f8a5cc83_1596x622.png" width="1456" height="567" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/98141f39-7627-4a46-9f78-1041f8a5cc83_1596x622.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:567,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:512515,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/196696152?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98141f39-7627-4a46-9f78-1041f8a5cc83_1596x622.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!IZuy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98141f39-7627-4a46-9f78-1041f8a5cc83_1596x622.png 424w, https://substackcdn.com/image/fetch/$s_!IZuy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98141f39-7627-4a46-9f78-1041f8a5cc83_1596x622.png 848w, https://substackcdn.com/image/fetch/$s_!IZuy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98141f39-7627-4a46-9f78-1041f8a5cc83_1596x622.png 1272w, https://substackcdn.com/image/fetch/$s_!IZuy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98141f39-7627-4a46-9f78-1041f8a5cc83_1596x622.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Left: A 3D dataset where red, blue, and green arrows represent the first through third principle components, respectively. Right: Scatterplot after the data is reduced to 2 dimensions. (<a href="https://towardsdatascience.com/principal-component-analysis-pca-explained-visually-with-zero-math-1cbf392b9e7d/">Image source</a>). </figcaption></figure></div><p>Now, what principle component analysis allows us to do is visualize high-dimensional data in a low dimensional space. For example, we can compress our 50-dimensional proteomic data into just a few ordered components that can be viewed in two dimensions (PC1 on the x-axis, PC2 on the y-axis). Additionally, after running this analysis we can plot the cumulative explained variance per principle component, which tells us how many PCs we need to retain the majority of information contained in the original dataset. Suppose we find that 20 components explains ~90% of the variance&#8212;this would allow us to work with just 20 features per subject instead of 50, which is helpful for a few reasons. First, many analyses downstream from PCA, such as k-means clustering, work better in lower dimensional space (see footnote #1 for a detailed explanation for when we should feed principle components into clustering algorithms and when we should use raw features instead)<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. Second, PCA can tell you whether our data actually has a low-dimensional structure, despite existing in a high-dimensional space, or whether it&#8217;s variance is diffusely spread across features, suggesting a more dilute signal (as is often the case in biology)<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CY5K!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03f32bee-6524-4c57-9ef3-1a21f247ef41_780x607.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CY5K!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03f32bee-6524-4c57-9ef3-1a21f247ef41_780x607.png 424w, https://substackcdn.com/image/fetch/$s_!CY5K!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03f32bee-6524-4c57-9ef3-1a21f247ef41_780x607.png 848w, https://substackcdn.com/image/fetch/$s_!CY5K!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03f32bee-6524-4c57-9ef3-1a21f247ef41_780x607.png 1272w, https://substackcdn.com/image/fetch/$s_!CY5K!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03f32bee-6524-4c57-9ef3-1a21f247ef41_780x607.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CY5K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03f32bee-6524-4c57-9ef3-1a21f247ef41_780x607.png" width="294" height="228.7923076923077" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/03f32bee-6524-4c57-9ef3-1a21f247ef41_780x607.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:607,&quot;width&quot;:780,&quot;resizeWidth&quot;:294,&quot;bytes&quot;:75908,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/196696152?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F722874e3-db25-4ac9-9cb9-1544d45cc3c6_806x654.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CY5K!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03f32bee-6524-4c57-9ef3-1a21f247ef41_780x607.png 424w, https://substackcdn.com/image/fetch/$s_!CY5K!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03f32bee-6524-4c57-9ef3-1a21f247ef41_780x607.png 848w, https://substackcdn.com/image/fetch/$s_!CY5K!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03f32bee-6524-4c57-9ef3-1a21f247ef41_780x607.png 1272w, https://substackcdn.com/image/fetch/$s_!CY5K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03f32bee-6524-4c57-9ef3-1a21f247ef41_780x607.png 1456w" sizes="100vw"></picture><div></div></div></a><figcaption class="image-caption">Cumulative explained variance by PCA component. </figcaption></figure></div><p>To this point, I&#8217;ve primarily been speaking about PCA from an information-centric perspective. Understanding that is step 1. But, what really confused me about principle component analyses even after I understood the above is what the principle components <em>really</em> represent. When we compress 50 proteins abundaces&#8217;s into a small handful of uncorrelated components, how should we think about these things? Additionally, if a given subject in our study has high level of PC1 compared to other subjects, what does that actually mean? </p><p>Assuming you know some linear algebra, the easiest way to think about a principle component is as a specific weighted linear combination of all your original features. For easy math, let&#8217;s say we have 5 proteins in a dataset (labeled proteins a-e). Each principle component is a weighted linear combination of  these features, which can be understood with the formula below where Z&#7522; is the &#119894;-th principal component, w&#7522;&#11388;are the weights (aka loadings) for that component, and X&#8336;..X&#8337; are the original feature&#8217;s values.</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;Z_i = w_{i1}X_a + w_{i2}X_b + w_{i3}X_c + w_{i4}X_d +  w_{i5}X_a + &quot;,&quot;id&quot;:&quot;KFZGIFNNZZ&quot;}" data-component-name="LatexBlockToDOM"></div><p>When we run PCA, every protein in the dataset gets a unique loading value, or weight, for each principle component. This loading value indicates the degree to which that protein contributes to given component and in which direction (positive or negative). If a protein has a high positive loading for PC1, it means that protein&#8217;s expression level is positively correlated with the PC1 score. Therefore, if a patient has high expression (high abundance) of that protein, the PCA algorithm calculates a positive score for them, moving them to the positive end of the PC1 axis. On the flip side, a protein with a high negative loading for PC1 is negatively correlated with the PC1 score. As a result, is a patient high abundance of that protein, they will be pushed towards the negative (opposite) end of the PC1 axis. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!uJYl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d3cb7d0-e958-45a0-b1c6-327b16f06bf3_1150x780.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!uJYl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d3cb7d0-e958-45a0-b1c6-327b16f06bf3_1150x780.png 424w, https://substackcdn.com/image/fetch/$s_!uJYl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d3cb7d0-e958-45a0-b1c6-327b16f06bf3_1150x780.png 848w, https://substackcdn.com/image/fetch/$s_!uJYl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d3cb7d0-e958-45a0-b1c6-327b16f06bf3_1150x780.png 1272w, https://substackcdn.com/image/fetch/$s_!uJYl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d3cb7d0-e958-45a0-b1c6-327b16f06bf3_1150x780.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!uJYl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d3cb7d0-e958-45a0-b1c6-327b16f06bf3_1150x780.png" width="449" height="304.53913043478263" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3d3cb7d0-e958-45a0-b1c6-327b16f06bf3_1150x780.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:780,&quot;width&quot;:1150,&quot;resizeWidth&quot;:449,&quot;bytes&quot;:204382,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/196696152?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c4f40b9-23b4-4ad3-ad96-c54482438b05_1150x780.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!uJYl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d3cb7d0-e958-45a0-b1c6-327b16f06bf3_1150x780.png 424w, https://substackcdn.com/image/fetch/$s_!uJYl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d3cb7d0-e958-45a0-b1c6-327b16f06bf3_1150x780.png 848w, https://substackcdn.com/image/fetch/$s_!uJYl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d3cb7d0-e958-45a0-b1c6-327b16f06bf3_1150x780.png 1272w, https://substackcdn.com/image/fetch/$s_!uJYl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d3cb7d0-e958-45a0-b1c6-327b16f06bf3_1150x780.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Patient 1 (green) has a high abundance of proteins with negative loadings for PC1 and PC2. Patient 2 (pink) by contrast has a high abundance of proteins with positive loadings for PC1 and PC2. </figcaption></figure></div><p>Now, if we take a specific linear combination of proteins (i.e., a principle component) we can determine whether it&#8217;s biologically meaningful by extracting its loadings. In simple terms, this is the weight of each protein in our original dataset for that component. We can then rank order proteins by their absolute loading values (while still being mindful of their direction) and ask &#8220;are the top contributes enriched in a particular pathway?&#8221; For example, we may see that PI3K/AKT/mTOR pathway proteins have high positive loadings for PC1, and immune checkpoint proteins have high positive loadings for PC2. This would indicate that the activation of the PI3K/AKT/mTOR signaling axis and the strength of immune checkpoint signaling are the two largest sources of variation in our data. From this, we might hypothesize that patients in the bottom right hand corner of a PC1 vs. PC2 x-y plot have the worst survival outcomes (high proliferation, low immune engagement) whereas those in the top left corner have the best survival (low proliferation, high immune engagement). </p><p>Alternatively, we may see that PI3K/AKT/mTOR pathway proteins have high positive loadings for PC1, and immune checkpoint proteins have high negative loadings for PC2, which would suggest that high PI3K/AKT/mTOR signaling and immune engagement are inversely associated in our data. Interrogating these types of outputs&#8212;rather than treating PCA as a push button analysis to be performed before clustering&#8212;is what sets a great computational biologist apart from the rest<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>. </p><div class="callout-block" data-callout="true"><p><strong>Thanks for reading!</strong> If you found this post useful, please consider subscribing or sharing it with a friend. I regularly shared hands-on computational biology techniques, fresh ways to think about tough problems, and perspectives on a range of related topics. </p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://sequenceanddestroy.substack.com/subscribe?"><span>Subscribe now</span></a></p></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p><strong>This raises the question, when should we feed principle components into k-means clustering and when should we use raw features (protein abundance values in this example)?</strong> A typical pipeline for clustering patients by protein expression is to z-score normalize proteins (column wise) to ensure all of them contribute equally to the analyses regardless of their original scales, then perform PCA, then k-means clustering. But, PCA isn&#8217;t always necessary (sometimes it&#8217;s even counter productive!). As a rule of thumb, PCA should be used before clustering when you have more features (proteins) than samples (patients), or when your data is noisy. So, for example, if you have 500 proteins and 10000 patients, PCA makes sense. Alternatively, you should consider using raw scaled features when the number of proteins you&#8217;ve measured is modest relative to your sample size and you want to preserve the full feature space (ex, if you have ~100 proteins and patients both). Technically, you don&#8217;t violate any statical assumptions or rules when using PCA in cases where samples &gt;&gt; features, but you do risk losing valuable signals. A protein that contributes modestly to many principle components, but strongly to none, may never make it into your top PCs, despite potentially being the most clinically relevant feature. You can never know if you just compress it away unnecessarily. This is especially important to consider when using targeted proteomics panels, like reverse phase protein arrays where we often measure just ~150 proteins, yet may have hundreds or thousands of samples (as in large clinical trials, such as the ISPY-2 neoadjuvant trial).</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Biological systems don&#8217;t actually operate in ~20,000-dimensional space, even though that&#8217;s how approximately many protein coding genes we can measure. The &#8220;true&#8221; dimensionality of biology may be orders of magnitude lower&#8212;perhaps hundreds of fundamental processes like immune response, cell cycle progression, or metabolic state that manifest through coordinated expression changes across thousands of genes and proteins. </p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>For more on this topic, check out <a href="https://sequenceanddestroy.substack.com/p/issue-69-what-actually-makes-someone">What Actually Makes Someone Good at ML in Computational Biology?</a></p></div></div>]]></content:encoded></item><item><title><![CDATA[Issue #74 // A Cluster Isn't A Category ]]></title><description><![CDATA[Unsupervised clustering and supervised classification are not analogous, but when combined appropriately can produce outsized effects.]]></description><link>https://sequenceanddestroy.substack.com/p/issue-74-a-cluster-isnt-a-category</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-74-a-cluster-isnt-a-category</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Wed, 06 May 2026 18:02:10 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/732dfcc5-eb59-473d-a851-8782454ed129_1434x848.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3C-9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3C-9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 424w, https://substackcdn.com/image/fetch/$s_!3C-9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 848w, https://substackcdn.com/image/fetch/$s_!3C-9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 1272w, https://substackcdn.com/image/fetch/$s_!3C-9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3C-9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png" width="1456" height="308" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:308,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:277062,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/196584918?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3C-9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 424w, https://substackcdn.com/image/fetch/$s_!3C-9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 848w, https://substackcdn.com/image/fetch/$s_!3C-9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 1272w, https://substackcdn.com/image/fetch/$s_!3C-9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d726996-303b-43d4-baac-4b9df465be4e_1692x358.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><h2>Issue &#8470; 74 // A Cluster Is Not A Category </h2><p>The distinction between classification and clustering is typically introduced early in computational biology and applied ML courses to help differentiate supervised from unsupervised learning. As a general heuristic, classification predicts outcomes, such a therapeutic response, while clustering finds patterns in data. Understanding this distinction is important because these two approaches both require different inputs and answer different biological questions. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!lJzJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea601f-c4e5-4d58-ac2a-cb7263e5ffcd_801x229.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lJzJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea601f-c4e5-4d58-ac2a-cb7263e5ffcd_801x229.png 424w, https://substackcdn.com/image/fetch/$s_!lJzJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea601f-c4e5-4d58-ac2a-cb7263e5ffcd_801x229.png 848w, https://substackcdn.com/image/fetch/$s_!lJzJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea601f-c4e5-4d58-ac2a-cb7263e5ffcd_801x229.png 1272w, https://substackcdn.com/image/fetch/$s_!lJzJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea601f-c4e5-4d58-ac2a-cb7263e5ffcd_801x229.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lJzJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea601f-c4e5-4d58-ac2a-cb7263e5ffcd_801x229.png" width="536" height="153.23845193508114" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/34ea601f-c4e5-4d58-ac2a-cb7263e5ffcd_801x229.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:229,&quot;width&quot;:801,&quot;resizeWidth&quot;:536,&quot;bytes&quot;:46508,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/196584918?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c71bead-060f-4e93-b432-31b919fbf7ac_801x290.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!lJzJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea601f-c4e5-4d58-ac2a-cb7263e5ffcd_801x229.png 424w, https://substackcdn.com/image/fetch/$s_!lJzJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea601f-c4e5-4d58-ac2a-cb7263e5ffcd_801x229.png 848w, https://substackcdn.com/image/fetch/$s_!lJzJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea601f-c4e5-4d58-ac2a-cb7263e5ffcd_801x229.png 1272w, https://substackcdn.com/image/fetch/$s_!lJzJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F34ea601f-c4e5-4d58-ac2a-cb7263e5ffcd_801x229.png 1456w" sizes="100vw"></picture><div></div></div></a><figcaption class="image-caption">Classification vs. Clustering. Same data points, different purpose (<a href="https://analystprep.com/study-notes/cfa-level-2/quantitative-method/supervised-machine-learning-unsupervised-machine-learning-deep-learning/attachment/cfa-level-2-classification-vs-clustering/">image source </a>). </figcaption></figure></div><p>Classification requires training data with labeled examples&#8212;such as patient&#8217;s protein abundances and treatment outcomes&#8212;then learns a decision boundary which is applied to new samples (unseen during training), thus answering the  question: &#8220;given what we know about these labeled examples, which category does this new sample belong to?&#8221; From this problem statement its clear that classification models are appropriate when categories are defined in advance and your goal is to assign new instances to these categories (for example, predicting whether a tumor will response to a drug or calling a variant pathogenic or benign). </p><p>Clustering by contrast asks, &#8220;what structure exists in our data that we didn&#8217;t specify in advance?&#8221; When a k-means clustering algorithm groups patients by their gene expression or protein abundance it&#8217;s finding underlying patterns without being told what to look for. As a result, clustering is most useful for discover&#8212;for example, identifying new candidate genomic subtypes in breast cancer, which can latter be clinically validated. However, we need to be careful with how we interpret clustering results, remembering that a cluster is not a category; it&#8217;s a region of high density a particular feature space, defined by a particular distance metric, found by a particular algorithm initialized with particular parameters. Changing any of those will change the clusters. </p><p>This distinction is particularly important for computational biologists, as bioinformatics analyses often blur the lines between clustering and classification. Take single-cell RNA-sequencing for example, where unsupervised clustering is used to define cell types and identified clusters are subsequently treated as labels for classification models, resulting in a circular argument. This isn&#8217;t always a problem&#8212;such as when cell-type clusters are validated by experts or <a href="https://sequenceanddestroy.substack.com/p/issue-61-orthogonal-validation-over">orthogonal measurements</a> prior to being used as labels&#8212;but it&#8217;s easy to make mistakes here, assuming clusters represent something &#8220;real&#8221; and concrete when they may just be statistical artifacts (I wrote about this previously in the post below). </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!OP3y!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff37c60f5-f433-4e71-bd52-ef4efaf85482_1100x928.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!OP3y!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff37c60f5-f433-4e71-bd52-ef4efaf85482_1100x928.png 424w, https://substackcdn.com/image/fetch/$s_!OP3y!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff37c60f5-f433-4e71-bd52-ef4efaf85482_1100x928.png 848w, https://substackcdn.com/image/fetch/$s_!OP3y!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff37c60f5-f433-4e71-bd52-ef4efaf85482_1100x928.png 1272w, https://substackcdn.com/image/fetch/$s_!OP3y!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff37c60f5-f433-4e71-bd52-ef4efaf85482_1100x928.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!OP3y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff37c60f5-f433-4e71-bd52-ef4efaf85482_1100x928.png" width="502" height="423.50545454545454" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f37c60f5-f433-4e71-bd52-ef4efaf85482_1100x928.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:928,&quot;width&quot;:1100,&quot;resizeWidth&quot;:502,&quot;bytes&quot;:236403,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/196584918?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fca806202-b980-488c-839e-3bc2a55be44d_1100x928.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!OP3y!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff37c60f5-f433-4e71-bd52-ef4efaf85482_1100x928.png 424w, https://substackcdn.com/image/fetch/$s_!OP3y!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff37c60f5-f433-4e71-bd52-ef4efaf85482_1100x928.png 848w, https://substackcdn.com/image/fetch/$s_!OP3y!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff37c60f5-f433-4e71-bd52-ef4efaf85482_1100x928.png 1272w, https://substackcdn.com/image/fetch/$s_!OP3y!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff37c60f5-f433-4e71-bd52-ef4efaf85482_1100x928.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Now, that we&#8217;ve covered the delineation between clustering and classification, I want to pivot to discussing how these methods can be used synergistically. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Suppose we want to build a classifier to predict the following categorical labels assigned to patients by a molecular oncologist: treatment responder, treatment non-responser, partial responser (stable cases). In this case, let&#8217;s assume we have a labeled dataset where index rows are samples (i.e., patients), columns 0-99 are z-score normalized protein abundance values, and the final column contains one of the three aforementioned category labels for each sample. Before we even consider training a classification model, it&#8217;s worth asking a prior question: do the labels assigned by the molecular oncologist actually capture real biological variation, or are they arbitrary clinical distinctions imposed on a continuous underlying distribution? </p><blockquote><p><strong>Note:</strong> This is exactly the type of question that differentiates <a href="https://sequenceanddestroy.substack.com/p/issue-69-what-actually-makes-someone">someone who is good at ML in computational biology</a> from someone who can run push-button analyses. The separator from good and great practitioners is rarely the code they write&#8212;it&#8217;s how they interrogate outputs and what they bring to that interrogation that no automated pipeline can replicate. </p></blockquote><p>In order to answer this question, you can run k-means clustering (with the &#8220;optimal&#8221; number of clusters) on your z-score normalized proteomic features along without touching the category labels. Then, you can use Pandas&#8217; cross-tab function to compare the resulting cluster assignments against patients known response categories. If responders predominantly group in one cluster with non-responders and partial-responders in another, the unsupervised learning model and clinical annotations are well aligned suggesting that the oncologist&#8217;s response categories are biologically &#8220;real&#8221; (i.e., they don&#8217;t represent arbitrary boundaries imposed by the human mind). </p><p>If on the other hand,  responders, non-responders, and partial-responders are distributed across clusters (co-mingling with one another), we have to entertain one of two possibilities: the proteomic features you&#8217;ve measured don&#8217;t capture the relevant biology or the response categories don&#8217;t reflect clean molecular distinctions (or both). In either case, training a classifier would be a waste of time. As a result, we can consider clustering and cross-tabulating to be a sanity check on whether supervised learning (classification) is even warranted. </p><p>Assuming our cross-tabulation results look good (i.e., clusters recapitulate classifications), we&#8217;re in a good position to build a classifier that can generalize to new, unseen, samples (if you&#8217;re interested in learning the details about this process check out <a href="https://github.com/evanpeikon/Build_A_Biomarker">How to Develop Predictive Biomarkers</a>). To pull this off we&#8217;d take our labeled data&#8212;protein features as input, response category as the target&#8212;then train a model to learn the mapping that represents a linear or non-linear combination of proteins that best predict treatment response from our labeled examples (training data). </p><p>The ability to generalize to new unseen data is one of the qualities that sets classification apart from clustering. A trained classifier learns a decision boundary that can be applied to patients it has never encountered. When we add a new patient&#8217;s baseline protein abundance data through a k-means model on the other hand, we&#8217;ll get a cluster assignment&#8212;importantly, one based on proximity to centroids that shifts when we add new data or alter the distance metric. Clinically, this is the difference between a usable tool and a novel research finding. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>In summary, clustering can be used to validate whether label assignments in your data are consistent with biological reality. Classification, by contrast, can be used to build a model that generalizes to new unseen data. The two methods reinforce each other and work synergistically. However, used interchangeably or without understanding what question each one answers, they produce findings that are nonsensical, or misleading. </p><div><hr></div><div class="callout-block" data-callout="true"><p><strong>Thanks for reading!</strong> If you found this post useful, please consider subscribing. I share hands-on computational biology techniques, fresh ways to think about tough problems, and perspectives on a range of related topics. </p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://sequenceanddestroy.substack.com/subscribe?"><span>Subscribe now</span></a></p></div>]]></content:encoded></item><item><title><![CDATA[Issue #73 // The Orchestrator's Edge]]></title><description><![CDATA[On coding literacy, domain expertise, and how computational biologists stay irreplaceable in the age of AI]]></description><link>https://sequenceanddestroy.substack.com/p/issue-73-the-orchestrators-edge</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-73-the-orchestrators-edge</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Sun, 26 Apr 2026 10:00:59 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/cb75dc5b-4f2c-4ca1-8cd3-c7269abc4a10_1388x818.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JRQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" width="1456" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:558245,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/170026527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong>Thanks for reading!</strong> If you found this post useful, please consider subscribing. I share hands-on computational biology techniques, fresh ways to think about tough problems, and perspectives on a range of related topics. All free, straight to your inbox:</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://sequenceanddestroy.substack.com/subscribe?"><span>Subscribe now</span></a></p><div><hr></div><h2><strong>Issue </strong>&#8470; 73 // The Orchestrator&#8217;s Edge</h2><p>As a dry lab scientist, I occasionally worry about AI&#8217;s impact on job availability or what it will mean to be a researcher in a world where machines begin to make scientific discoveries that humans can&#8217;t understand.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> Despite being skeptical of AI companies&#8217; strategic messaging&#8212;which I previously wrote about in <a href="https://www.connectedideasproject.com/p/how-to-win-markets-and-influence?utm_source=publication-search">how to win markets and influence policy</a>&#8212;I still find myself thinking "what if the pool of well paying computational biology jobs dries up?" or "what if I spent years earning a PhD in a field that&#8217;s soon to be consumed by agentic AI?"</p><p>Yet, ironically, on my best days&#8212;the ones where I&#8217;m most engaged with my work&#8212;I use AI to debug complex programming scripts, help refactor spaghetti code, and  summarize documentation for unfamiliar Python libraries. Inevitably this leads to cognitive dissonance over using the very system whose negative externalities I&#8217;m worried about to augment my own capabilities. It&#8217;s not dissimilar to professional cyclists who rail against PEDs, then use them anyway because everyone else does&#8212;reinforcing their necessity and entrenching their influence. </p><p>I suspect I&#8217;m not the only one with mixed feelings about AI&#8217;s role in science and society, yet I&#8217;ve seldom seen this perspective reflected in a way that resonates for me. This is my motivation for writing this piece&#8212;not to shape the narrative around AI in computational science or influence your opinion, but to clarify my own thinking while grappling with these ideas. As a result, you can expect this piece to move between the personal and the practical&#8212;from my current perspective to where I see things heading, and from my thoughts on future-proofing my role as a scientist to the skills I&#8217;m actively trying to develop.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h3><strong>When Perfect Code Produces Imperfect Science Redux </strong></h3><p>In <a href="https://sequenceanddestroy.substack.com/p/when-perfect-code-produces-imperfect?utm_source=publication-search">when perfect code produces imperfect science</a> I hypothesized that the most successful bioinformaticians in the next few years will be the ones who understand computation deeply enough to decompose complex biological questions into analytical components, orchestrate AI tools to implement their vision, and maintain the skepticism to question whether their code truly answers their biological questions.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>  What I was trying to articulate in this piece was that rather than AI replacing computational biologists wholesale, I believe it will instead push us to orchestrate analyses at higher levels of abstraction. Eleven months later, the New York Times Magazine published an article that came to the same conclusion&#8212;<a href="https://www.nytimes.com/2026/03/12/magazine/ai-coding-programming-jobs-claude-chatgpt.html">Coding After Coders: The End of Computer Programming as We Know It</a>&#8212;where the author Clive Thompson writes:</p><blockquote><p>"With A.I., though, programmers ascend to an even higher level of abstraction. They describe, in regular language, what the program should do, and the agents translate that idea &#8212; that human <em>intent </em>&#8212; into code. Writing software no longer means mentally juggling the nuances of a language like Python, say, or JavaScript or Rust. Coding no longer involves messing up an algorithm and then trying to figure out where your error lies. That part, too, has been abstracted away." </p></blockquote><p>Taking it a step further, AI isn&#8217;t just pushing programmers to work at higher levels of abstraction&#8212;it&#8217;s changing the selection criteria for what makes a good programmer in general, especially in fields like computational biology that require significant domain expertise in addition to scientific computing skills. In a recent piece titled <a href="https://sequenceanddestroy.substack.com/p/issue-69-what-actually-makes-someone">what actually makes someone good at ML in computational biology?</a>, I expanded on these ideas, discussing how the separator between good and great ML practitioners in comp bio isn&#8217;t the code they write, but how they interrogate its output and what they bring to that interrogation that no automated pipeline can replicate. Then, in <a href="https://sequenceanddestroy.substack.com/p/issue-70-why-interpretable-models">why interpretable models outperform black boxes in biology</a> I wrote:</p><blockquote><p>"Accuracy, in a sense, is a commodity that can be bought. Given enough money, you can buy your way to a better model&#8212;more high quality training data, more compute, larger teams, and greater access to proprietary datasets. This gives early movers a huge competitive advantage, but given enough resources a well-funded competitor can close almost any accuracy gap. Interpretability, on the other hand, doesn&#8217;t work that way&#8212;it can&#8217;t be purchased at scale. Instead, it requires deep domain expertise, biological intuition, and the kind of hard-won insight that you can&#8217;t list on a procurement order&#8212;it&#8217;s a bit of a dark art, and that&#8217;s what makes it defensible."</p></blockquote><p>These pieces both point to the same idea&#8212;that the value of computational biologists isn&#8217;t shrinking, but shifting. The skills that once set a computational biologist apart&#8212;like writing pipelines, automating workflows, keeping a lab&#8217;s analysis infrastructure running&#8212;are no longer sufficient on their own. Instead, we&#8217;re living in a world that places high value on combining computational thinking in a broad sense with biological knowledge and mechanistic understanding.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> </p><p>This shift from implementation to orchestration may suggest to some that learning to code is a waste of time in 2026, and in years to come. I&#8217;d argue the opposite: coding literacy has never mattered more, even when AI is capable of writing most code for major projects.  When you&#8217;re no longer capable of tracing every line of AI-generated code back to your own reasoning, your ability to read, interrogate, and sanity-check what a machine produces becomes the last line of defense against silent compounding errors, which ultimately determines the validity of downstream scientific results. </p><p>The case for learning to code isn&#8217;t, then, to compete with generative AI. It&#8217;s to stay fluent enough to know when you can trust it. And yet, for a growing number of students and early-career researchers that case is increasingly difficult to make. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h3><strong>The Psychology of Learning to Code In the Age of AI</strong></h3><p>Learning to code has never been more accessible than it is right now&#8212;the resources are better, more abundant, and more personalized than ever. Yet, for many, the motivation to learn gets harder to muster by the month. This must be how medical  residents training to become radiologists felt ten years ago, after Geoffrey Hinton stated the following:</p><blockquote><p>"If you work as a radiologist, you&#8217;re like the coyote that&#8217;s already over the edge of the cliff but hasn&#8217;t yet looked down, so he doesn&#8217;t realize there&#8217;s no ground underneath him. I think we should stop training radiologists now. It&#8217;s just completely obvious that within five years, deep learning is going to do better than radiologists&#8230; we&#8217;ve got plenty of radiologists already"</p></blockquote><p>For residents in the middle of their training, that must have been a destabilizing thing to hear: years of investment, a clear career trajectory, suddenly reframed as a sunk cost. As it turned out, Hinton was wrong. Radiologists are in higher demand than ever, with average US salaries reaching $673k as of 2026.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> But knowing that now doesn&#8217;t tell you what it felt like to be a radiology trainee then&#8212;absorbing that verdict mid-training, with no way to know whether the expert making it was right.</p><p>That feeling has a contemporary equivalent. I recently spoke to a fellow PhD student who described a feeling of nihilism about their technical skill development&#8212;they want to improve, but easily become demotivated when they imagine AI always outpacing them. What troubled them wasn&#8217;t just the capability gap, but something harder to articulate: that orchestrating AI to execute tasks made their own involvement feel interchangeable. "If I can prompt Claude Code to do this," they said, "so can anyone else. What am I actually contributing?"</p><p>I&#8217;ve grappled with the same thought. Like the cyclist using PEDs, I simultaneously feel it&#8217;s necessary to use AI to keep pace with colleagues and their expectations, and that doing so diminishes the value of my own contributions. Yet, this obscures what&#8217;s really happening. I still need to make the key decisions during a project, whether using an AI co-pilot or not: what questions to ask, how to frame the analyses, what the results mean biologically, and how to situate findings within the broader literature. These are the intellectual contributions that matter. Code generation is a tool, the same way statistical software is a tool. What you cannot outsource is the scientific reasoning.</p><p>Knowing this is one thing. Believing it is another. It&#8217;s easy to feel effective, but  fraudulent, when using AI to augment your workflow. I&#8217;d argue that feeling has two distinct sources. First, we sometimes use AI to rapidly prototype code, and end up using tools or outputs we don&#8217;t understand well enough to audit&#8212;this is knowledge debt, and it&#8217;s a real problem that needs to be actively managed. The second source is a psychological artifact of the AI era&#8212;the sense that because AI is doing the heavy lifting, you&#8217;ve become a passenger rather than a contributor, that your skills no longer count. This is a valid feeling, but it&#8217;s worth reframing. </p><p>Consider a film director who relies on a cinematographer to shoot the footage for their movie. Nobody questions whose vision the film represents, or whether the director&#8217;s creative contribution is diminished by not having operated the camera themselves. The judgment about what story to tell, how to frame it, and what it means is still entirely the director&#8217;s. Similarly, the fact that AI can execute code faster doesn&#8217;t change who is responsible for the question and the interpretation. </p><p>The knowledge debt problem and the psychological artifact are easy to conflate because they produce similar surface symptoms&#8212;both make you feel like you don&#8217;t fully own your work. But their origins are different. Knowledge debt is epistemic: there are things you genuinely don&#8217;t understand about what your pipeline is doing, and that gap has real consequences for scientific validity. The psychological artifact is emotional: you understand the work, you made the key decisions, but the feeling of authorship hasn&#8217;t caught up with the reality of how the work was produced. Treating one as the other makes both worse&#8212;auditing code you already understand doesn&#8217;t resolve the feeling of being a passenger, and reframing your mindset doesn&#8217;t close a genuine gap in your understanding of what your model did.</p><p>Underneath all of this sits a harder question though. If AI gets better not just at writing code but at reasoning through problems, what exactly is left for the scientist who is still early in their career, still building the intuition that&#8217;s supposed to be irreplaceable? </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h3><strong>The Role of the Junior Scientist</strong> </h3><p>In the previous section I argued that code generation is a tool in the same way that statistical software is a tool, and that scientific reasoning is both what matters most and the skill that cannot be outsourced. But this raises a harder question: what happens if AI advances to the point where it can genuinely reason through problems&#8212;not pattern-match at scale, but formulate hypotheses, design experiments, and interpret results the way a scientist does? The idea that AI is a bicycle for the mind is a compelling one. But if the bicycle eventually thinks for itself, what remains for the human rider?</p><p>The question of whether reasoning can be outsourced to machines is a debate. It&#8217;s also the thesis behind multiple companies developing "AI scientists". I don&#8217;t know if this will ever be possible in the way it&#8217;s advertised. I&#8217;m also not sure it matters. Take the following example: you bring Einstein into your company and ask him to make it better. In order to direct him usefully, you need to understand the problem well enough to know what you&#8217;re asking him to solve. The same logic applies to AI. Telling it to "do research" produces nothing of value without specificity. You have to know what problem is worth solving, roughly how it should be approached, and how the work makes contact with the real world. AI may find a better implementation than what you had in mind, but you provide the scaffold that allows it to realize its potential. This points to the following conclusion: the orchestrator role isn&#8217;t a consolation prize&#8212;it&#8217;s where the real intellectual work lives.</p><p>If the orchestrator model is where things are headed, a reasonable concern follows: what happens to junior scientists?  If a senior computational biologist can just orchestrate AIs and architect a plan, what need is there for a junior scientist? This is a legitimate problem, though I also think it overlooks the complexity of actually running companies and all of the different odds and ends you can have people work on.</p><p>While there may be a dearth of jobs for junior scientists in established companies in the short term, I see their career prospects increasing with time. Here&#8217;s why: historically, a single computational biologist or data scientist at a small company was rarely cost-effective. The value these roles generate typically required teams, which confined them to large pharma or well-funded biotechs. But if one scientist with AI tools can produce what previously required three to five, the economics change for a much broader set of employers. Small companies that could never justify building out a computational team may now be able to justify hiring one strong person. The total number of companies that can deploy this kind of expertise has expanded&#8212;and with it, potentially, the total number of roles available, even if the headcount per company remains low.</p><p>I&#8217;ve seen this play out myself. Over the past decade, I&#8217;ve had contact points in many companies and have seen ways that just a little bit of data science expertise can take them from good to great, but there was never an ROI to building out the team to make that doable&#8212;conversations with founders have told me as such. For a concrete example, take a hardware product like an EMS device and imagine how integrating biosensor feedback for closed-loop modulation can make it way better. For example, utilizing real-time muscle oxygenation data to adjust the stimulus magnitude and frequency. The value proposition is there, but before now making that happen was very expensive. I&#8217;ve had some of these same companies reach out to me in recent months asking if I can help them with these same types of problems. Before, some of these projects would have required at least a team of three people and a year of dedicated effort, often at a minimum cost of exceeding $225k. Now, I can run one of these projects end to end by myself in a few months, drastically reducing costs and making it a real option.</p><p>A lot of companies can benefit from this type of work. A junior computational biologist making $75k per year can bring in more value than they cost, especially for smaller companies. Additionally, the number of more established companies that can justify 2-3 senior scientists have multiplied, creating more jobs in total. That said, there&#8217;s a devil&#8217;s advocate case worth taking seriously. The idea that small-to-mid sized companies will absorb displaced talent assumes those companies can identify the need, afford even one person, and that the value is legible to non-technical founders. In practice, demand doesn&#8217;t materialize automatically&#8212;a lot of small companies don&#8217;t know what they don&#8217;t know, and they won&#8217;t go looking for a computational biologist or data scientist if they&#8217;ve never had one. The demand isn&#8217;t automatic; it has to be created or surfaced.</p><p>This argument is fair, and it points to something important: as the comp biologist, you need to be diligent about networking, writing, and finding these opportunities. Eventually, the market will sort this out once the concept is proven, but in the short term you need to make these opportunities for yourself. I&#8217;ve given this advice to junior scientists before&#8212;they may be the best in the world, but no one is going to know that if they don&#8217;t prove it. Writing increases your surface area for luck. This also ties into the idea of making sure you know where your paycheck comes from, which Sean Goedecke articled well in an article titled <a href="https://www.seangoedecke.com/where-the-money-comes-from/">Knowing where your engineer salary comes from</a> (i.e., if you cannot clearly articulate your value, no-one else will understand it either). </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h3>How to Uniquely Add Value in the Age of AI</h3><p>Given everything above, what does a concrete training regimen actually look like? The following is what I&#8217;d focus on&#8212;not as an exhaustive list, but as the highest-leverage investments for a computational biologist trying to build durable value in an AI-augmented field.</p><p><strong>Practice the scaffold, not the implementation.</strong> The step AI is worst at is also the most important one: taking a vague biological question and decomposing it into a tractable computational problem. This is where projects are won or lost before a single line of code is written. Spend deliberate time here&#8212;not just asking "how do I analyze this data" but "what would a convincing answer to this question actually look like, and what would falsify it?" That kind of problem formulation is a skill, and like most skills it improves with intentional practice.</p><p><strong>Build cross-domain pattern recognition.</strong> Reading broadly isn&#8217;t just intellectually satisfying&#8212;it&#8217;s also strategically useful. Complexity science, ecology, economics, and network theory all share structural similarities with problems in computational biology. The scientist who recognizes that a gene regulatory network behaves like a market will generate hypotheses that an AI trained on biology literature alone will not. This has become something I&#8217;d consider an advantage in my own work. Operating across precision oncology, biodefense, and human performance can look like a liability from the outside&#8212;the jack of all trades, master of none. In practice, I&#8217;m constantly importing ideas from one discipline into another, and the cross-pollination produces things that a narrower focus wouldn&#8217;t.</p><p><strong>Get good at scoping and saying no.</strong> In the embedded scientist model, a significant portion of your value comes from knowing which projects will hit a dead end before the team invests six months finding out. This is one of the most valuable skills I developed in my time at NNOXX. There were often app features or use cases that seemed interesting, and even viable, superficially, but upon working on them we quickly realized they couldn&#8217;t be reduced to practice. Over time, it became easier and easier to spot these things before wasting days or weeks working on them. This skill is learnable, but mostly through deliberate reflection on past projects rather than doing more projects faster, which is ironically what AI encourages. Build the habit of asking, after every project, where the dead end was and how early the signal was visible.</p><p><strong>Develop a point of view.</strong> The scientists who will matter most in an AI-augmented world are the ones with strong, defensible opinions about what&#8217;s biologically true and why. The person who can say "this result is almost certainly an artifact because of X biological reason" and be right is providing a something scare. Cultivate opinions. Be willing to be wrong in specific, correctable ways rather than vague, unfalsifiable ones.</p><p><strong>Learn enough about how the models work to know when not to trust them.</strong> Not AI-research-level depth, but enough to understand where LLMs hallucinate confidently, where they fail on compositional reasoning, and where their training data is thin. This is what separates someone who uses AI well from someone who inadvertently introduces AI errors into published science.</p><p>A concrete example from my own work illustrates why this matters. AI is consistently poor at interpreting phosphoprotein data. If your data shows EGFR T654 is upregulated in a breast cancer sample, a model will generate a perfectly coherent explanation about EGFR hyperactivation driving downstream signaling&#8212;except EGFR T654 is an inhibitory phospho site. The same failure mode appears with PTEN S380. When a model encounters PTEN, it defaults to PI3K inhibition and reduced AKT signaling. It doesn&#8217;t recognize that elevated PTEN S380 may indicate the opposite. The reason is straightforward&#8212;most papers discuss proteins without phosphorylation-site context, so the model has no basis for the distinction. The output sounds like expertise. It isn&#8217;t. This is the kind of failure that domain knowledge catches and automated pipelines don&#8217;t, and it&#8217;s a good illustration of why fluency in the underlying biology remains non-negotiable even as the code becomes increasingly optional.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h3>Closing Thoughts</h3><p>I started this piece admitting to cognitive dissonance&#8212;using the very system I&#8217;m conflicted about to do my best work, feeling simultaneously that it&#8217;s necessary and that it diminishes something. I haven&#8217;t resolved this tension.  What I&#8217;ve come to instead is something more like a working position. AI is a force multiplier, which means it amplifies what you bring to it. If you bring shallow domain knowledge and poor problem formulation, it will help you produce shallow work faster. If you bring genuine biological intuition, hard-won judgment about what questions are worth asking, and the ability to recognize when an output is wrong it will extend your reach in ways that weren&#8217;t possible before.</p><p>The scientists who flourish in this environment won&#8217;t be the ones who resisted AI or the ones who outsourced their thinking to it. They&#8217;ll be the ones who stayed fluent enough to direct it, skeptical enough to audit it, and grounded enough in the real world to know what it still can&#8217;t do. That&#8217;s a narrower target than it sounds, and it requires deliberate effort to hit. But it&#8217;s also, I think, a genuinely exciting place to be building from. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><div><hr></div><h3>**A Practical Addendum: On Becoming the Go-To Person for Companies That Don&#8217;t Know They Need You</h3><p>One practical addendum worth making explicit because it&#8217;s the piece most scientists skip entirely. Earlier I argued that demand for computational talent won&#8217;t materialize on its own&#8212;it has to be created. Here&#8217;s what that actually looks like in practice.</p><p>The first thing to recognize is that this is a visibility and translation problem, not a credentials problem. The companies you&#8217;re targeting can&#8217;t search for what they don&#8217;t know exists. A founder running a hardware company with a biosensor problem isn&#8217;t browsing LinkedIn for computational biologists&#8212;they don&#8217;t have the vocabulary for what they need. Your job is to show up in their world, speaking their language, before they know to look for you.</p><p>That starts with developing a sharp, non-technical articulation of what you do. Not "computational biology and machine learning" but something closer to &#8220;I help companies that have biological data or biological products figure out what that data is actually telling them, and build systems around it.&#8221; The more precisely you can describe the business problem you solve rather than the technical method you use to solve it the more recognizable you become to founders who have the problem but not the vocabulary. This is harder than it sounds. Most scientists are trained to lead with methods. Inverting that instinct takes practice.</p><p>The next step is creating artifacts that demonstrate value before any conversation happens. Writing publicly&#8212;a newsletter, a LinkedIn post, a project summary with confidential details obscured&#8212;does more than any credential. A founder who reads "I helped a hardware company integrate biosensor feedback for closed-loop modulation, cutting what would have been a $225K team effort down to a few weeks of one person&#8217;s time" will share that with every founder they know with a similar problem. Concrete, specific, outcome-oriented writing is the highest-leverage marketing available to someone in this position, and few people do it well without practice.</p><p>Then get into the rooms where founders talk to each other. Accelerators, industry-specific conferences, founder communities&#8212;not as a vendor, but as a participant with something to contribute to the conversation. The moment you solve one problem for one company inside a network, word travels in ways that no amount of cold outreach replicates. Most of the relationships that actually lead somewhere don&#8217;t start from a pitch&#8212;they start from a warm referral from someone who saw the value firsthand.</p><p>The timeline here is long, which makes it easy to deprioritize. It shouldn&#8217;t be. I can trace my role as co-founder at NNOXX in 2020 directly to an event at Metier coffee in Seattle in 2015. Several of the consulting projects I&#8217;m working on now are direct offshoots of conversations I had at in-person events two or three years ago. The payoff isn&#8217;t linear and it isn&#8217;t fast, but it compounds&#8212;and it&#8217;s extraordinarily difficult to replicate through any shortcut.</p><p>Finally, make it easy for non-technical people to refer you. Most of your best leads will come from people who don&#8217;t fully understand what you do but know you&#8217;re useful. Give them a sentence they can repeat without distorting it. &#8220;She&#8217;s the person you call when you have data and don&#8217;t know what it&#8217;s telling you&#8221; travels further than an accurate technical description, because it&#8217;s repeatable.</p><div><hr></div><p><strong>As always, I&#8217;d like to hear what other scientists in this space are thinking. Where does this resonate, and where does it miss? Feel free to<a href="https://www.evanpeikon.com/contact"> reach out</a>.</strong> </p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>See Asimov Press&#8217;s piece on this topic, <a href="https://www.asimov.press/p/legibility-problem">The Legibility Problem</a>. </p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Specifically, I wrote: "Much like how experienced PIs developed their scientific intuition through years at the bench, computational biologists will need to develop their &#8220;computational taste&#8221; by understanding the underlying structures of code, data, and algorithms. This taste will allow them to immediately sense when an analysis seems off because some underlying biological assumption has been violated. Even with powerful LLMs, learning to code proficiently remains essential&#8212;not because you&#8217;ll out code AI, but because coding literacy is your passport to directing these powerful tools toward meaningful biological discovery."</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>This is something LLMs still struggle at, and frankly have not gotten much better at over the last few years for reasons partly explored in<a href="https://sequenceanddestroy.substack.com/p/molecular-moonlighting?utm_source=publication-search"> Molecular Moonlighting</a>. </p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Robert Watcher wrote a good article on this topic, titled <a href="https://robertwachter.substack.com/p/why-do-you-still-have-a-job">"Why Do You Still Have a Job?"</a></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Sequence &amp; Destroy is a reader-supported publication. To receive new posts consider becoming a subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div></div></div>]]></content:encoded></item><item><title><![CDATA[Issue № 72 // Building A Statistical Pipeline For Group Comparisons]]></title><description><![CDATA[A practical guide to group comparisons in bioinformatics and computational biology]]></description><link>https://sequenceanddestroy.substack.com/p/issue-72-building-a-statistical-pipeline</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-72-building-a-statistical-pipeline</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Sun, 19 Apr 2026 10:03:29 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/40753670-0d06-415a-9bea-b4c484e9e90e_1354x812.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JRQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" width="1456" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:558245,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/170026527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong>Thanks for reading!</strong> If you found this post useful, please consider subscribing. I share hands-on computational biology techniques, fresh ways to think about tough problems, and perspectives on a range of related topics. All free, straight to your inbox.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://sequenceanddestroy.substack.com/subscribe?"><span>Subscribe now</span></a></p><div><hr></div><h2>Issue &#8470; 72 // Building A Statistical Pipeline For Group Comparisons</h2><p>I recently received a question about whether ANOVA is the best catch-all test for detecting differences between groups. The student asking this question wanted to build a pipeline with simple logic&#8212;the idea being that if ANOVA is the most versatile option, you could reduce code complexity and avoid needing fallbacks for situations like two-group comparisons or non-parametric data. The question also touched on whether you could bypass ANOVA entirely and build a pipeline around post hoc testing to simplify things further. This is the second time I&#8217;ve gotten essentially the same question, so I&#8217;m addressing it here for future reference.</p><p>Importantly, this is less of a pure statistical question and more of a practical one&#8212;it&#8217;s asking how firm the rules underlying these statistical tests are, and how far we can bend them before they break. <strong>My response is as follows:</strong></p><p>An ANOVA compares the means of three or more groups to determine if they are significantly different from one another, with the assumption that the data is normally distributed and has an ~equal variance. Practically speaking though, ANOVAs are mildly robust to variations in normality, especially with large sample sizes which is often the case in bioinformatics analyses. However, when data really isn&#8217;t normally distributed and you want to compare three or more groups, a Kruskal-Wallis H-test can be used in place of ANOVA. Notably, this test analyzes ranks instead of means, and is therefore robust to non-normal distributions. This allows us to start with some simple logic when building an analysis pipeline:</p><div class="highlighted_code_block" data-attrs="{&quot;language&quot;:&quot;plaintext&quot;,&quot;nodeId&quot;:&quot;fda5aa96-c281-4eea-84b3-8f4f92a2b93d&quot;}" data-component-name="HighlightedCodeBlockToDOM"><pre class="shiki"><code class="language-plaintext">Comparing 3+ groups?
&#9500;&#9472;&#9472; Normally distributed &#8594; ANOVA
&#9492;&#9472;&#9472; Not normally distributed &#8594; Kruskal-Wallis H-test</code></pre></div><p>Suppose we&#8217;re working in a scenario where the data is always normally distributed can we then use ANOVA as a catch-all? After all, an ANOVA could work with just two groups, which would make the code much simpler, so what&#8217;s the problem? </p><p>Technically an ANOVA can be used to compare the means of two groups, but it&#8217;s not ideal. Assuming data is normally distributed, a t-test would be the best bet in this scenario as the calculation is more straightforward (which itself isn&#8217;t a great reason, imo) and its statistical power&#8212;the ability to detect true differences&#8212;is higher<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. Alternatively, given two groups and non-normal data, a Wilcoxon rank sum test is a good choice (this is often an ideal test when comparing protein abundance between two groups, as this data is seldom normally distributed in my experience). Now, we have even further branching logic:</p><div class="highlighted_code_block" data-attrs="{&quot;language&quot;:&quot;markdown&quot;,&quot;nodeId&quot;:&quot;669da591-c340-4184-a8e2-cba15e81f308&quot;}" data-component-name="HighlightedCodeBlockToDOM"><pre class="shiki"><code class="language-markdown">Comparing 2 groups?
&#9500;&#9472;&#9472; Normally distributed &#8594; t-test
&#9492;&#9472;&#9472; Not normally distributed &#8594; Wilcoxon rank-sum test

Comparing 3+ groups?
&#9500;&#9472;&#9472; Normally distributed &#8594; ANOVA
&#9492;&#9472;&#9472; Not normally distributed &#8594; Kruskal-Wallis H-test</code></pre></div><p>Going back to ANOVA, this time addressing the portion of the question about post-hoc testing and whether we can bypass ANOVA entirely. If an ANOVA detects a significant difference between group means, post-hoc testing should be performed&#8212;the reason for this is that an ANOVA is an omnibus test, meaning it confirms that significant differences exist somewhere (this is also true of the Kruskal-Wallis H-test). But, it doesn&#8217;t give you specifics. For example, it may say there is a difference in fragment length ratios between groups A, B, C, and D, but we don&#8217;t know which of those group means actually differ. Figuring this out is the purpose of post-hoc testing. For simplicity, there are two easy options for post-hoc testing: Tukey&#8217;s Honest Significant Difference (HSD) test, and pairwise t-tests with Benjamini-Hochberg (BH) correction to control for the false discovery rate.</p><p>This raises the question though, why bother doing the ANOVA to begin with if we can just jump straight to post-hoc testing? The primary reason for this is that we want to control the type I error rate, which is the chance of making a false positive (ie, saying a difference exists between group means when none exists). If you just run a bunch of pairwise t-tests, for example, you increase your chances of falsely finding a significant difference between groups<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. Aside from that, there are also cases where an ANOVA can detect significant differences that a post hoc test, like pairwise t-tests of Tukey&#8217;s HSD cannot. In these cases, the overall group variation may be statistically significant, but specific pairwise differences are too small or noisy to meet the strict thresholds that post-hoc tests apply to control type I errors (thus making it harder to achieve significance). Finally, if you&#8217;re using Tukey&#8217;s HSD for post hoc testing specifically, ANOVA is a prerequisite assumption. In other words, Tukey&#8217;s HSD assumes the omnibus test&#8217;s results were significant and running it after a non-significant ANOVA (or without an ANOVA at all) violates the procedure. This is both considered methodologically unsound and is considered &#8220;fishing&#8221; for results, which is ill advised.</p><p>So, from a pipeline standpoint, our logic is now increasingly complex:</p><div class="highlighted_code_block" data-attrs="{&quot;language&quot;:&quot;plaintext&quot;,&quot;nodeId&quot;:&quot;4b2c12a8-5aef-4a3c-843f-32cf8f3b8c96&quot;}" data-component-name="HighlightedCodeBlockToDOM"><pre class="shiki"><code class="language-plaintext">Comparing 2 groups?
&#9500;&#9472;&#9472; Normally distributed &#8594; t-test
&#9492;&#9472;&#9472; Not normally distributed &#8594; Wilcoxon rank-sum test

Comparing 3+ groups?
&#9500;&#9472;&#9472; Normally distributed &#8594; ANOVA
&#9474;   &#9492;&#9472;&#9472; Significant &#8594; post hoc test (Tukey's HSD or pairwise t-test w/ BH correction)
&#9492;&#9472;&#9472; Not normally distributed &#8594; Kruskal-Wallis H-test
    &#9492;&#9472;&#9472; Significant &#8594; Dunn's test</code></pre></div><p>Whether you choose to use a pairwise t-test with BH correction or Tukey&#8217;s HSD after ANOVA comes down to a number of factors, part of which is personal preference. Personally, I find Tukey&#8217;s HSD more useful and easier to implement as it compares all possible pairs of group means and controls for the family-wise error rate (the probability of committing one or more Type I errors when performing multiple statistical tests within a single study) making it an &#8220;exact&#8221; test for determining which specific groups are different. At the same time Tukey&#8217;s HSD also limits type II errors (false negatives) by maintaining high statistical power.</p><p>However, there may be cases where you don&#8217;t care to compare all groups means, and may only want to perform a small number of predetermined tests, especially if you have a lot of groups. This scenario tends to be less common in my own work, but in these cases a paired t-test with BH correction is favorable, as it&#8217;s statistical power is higher, and there is a low risk of type I errors due to multiple comparisons (again, because only a small number of planned comparisons are being performed).</p><p>Now, because we&#8217;re biologists, we don&#8217;t just want to know if the means between groups are statistically significant. We want to know if those differences are meaningful. For example, we run an ANOVA and find a significant difference between the mean three groups, A, B, and C. We then use Tukey&#8217;s HSD and find significant differences between the means of each pairwise comparison (A/B, A/C, B/C). To figure out if these differences and practically meaningful, and not just statistically significant, we want to perform an additional test such as Cohen&#8217;s d, which measures the size of the difference between two means in units of standard deviation (ie, the effect size). For example, imagine a scenario where p&lt;0.05, but Cohen&#8217;s d is small&#8212; this would indicate that the result is statistically significant, but practically insignificant or weak. On the flip side, a p&gt;0.05, but large Cohen&#8217;s d could mean that your experiment found a practically meaningful difference between groups, but it was not statistically significant, potentially indicating that the study was underpowered.</p><p>Interpreting Cohen&#8217;s d can be a little tricky, but a good rule of thumb is as follows: </p><ul><li><p>Small effect = ~|0.2|, </p></li><li><p>Small-to-medium effect = ~|&gt;0.2 - &lt;0.5|, </p></li><li><p>Medium effect = ~|0.5|, </p></li><li><p>Medium to large effect = ~|&gt;0.5 - ,0.8|,</p></li><li><p>Large effect is &gt;~|0.8|, and </p></li><li><p>Very large effect &gt;~|1.5|. </p></li></ul><p>Note that we use the |absolute| values to determine the effect size. The specific sign of the value just tells us which group is larger. For example, a positive Cohen&#8217;s d indicates the control group&#8217;s mean is lower, and a negative Cohen&#8217;s d tells us the mean of the experimental group is lower. So, for the pipeline this now gives us:</p><div class="highlighted_code_block" data-attrs="{&quot;language&quot;:&quot;plaintext&quot;,&quot;nodeId&quot;:&quot;496b3a0c-166f-41ce-a8e2-4ca3e65cafe6&quot;}" data-component-name="HighlightedCodeBlockToDOM"><pre class="shiki"><code class="language-plaintext">Comparing 2 groups?
&#9500;&#9472;&#9472; Normally distributed &#8594; t-test
&#9492;&#9472;&#9472; Not normally distributed &#8594; Wilcoxon rank-sum test

Comparing 3+ groups?
&#9500;&#9472;&#9472; Normally distributed &#8594; ANOVA
&#9474;   &#9492;&#9472;&#9472; Significant &#8594; post hoc (Tukey's HSD or pairwise t-test w/ BH)
&#9474;       &#9492;&#9472;&#9472; Effective size &#8594; Cohen's d
&#9492;&#9472;&#9472; Not normally distributed &#8594; Kruskal-Wallis H-test
    &#9492;&#9472;&#9472; Significant &#8594; Dunn's test
        &#9492;&#9472;&#9472; Effect dize &#8594; Cohen's d</code></pre></div><p>Importantly, Cohen&#8217;s benchmarks were based on behavioral sciences and do not transfer to all fields; what is &#8220;large&#8221; in psychology may be considered small in genomics, or vice versa<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>. As a result, we can&#8217;t assume that a small p-value and large Cohen&#8217;s d means that something is biologically meaningful. To do that, we&#8217;d need to understand what amount of change matters in our system being studied (ideally, we should also pre-specify this ahead of time, even if it&#8217;s informally in our own lab notebook&#8212;this practice is underused in company biology and more valuable than most people appreciate). For example, even moderate differences in phospho-protein abundance have large biological repercussions, but the same can&#8217;t be said of all molecules or biological systems. Thus, this isn&#8217;t something we can bake into our code and pipeline. It&#8217;s something we need as end-users of the tools we build, which speaks to the same issues I discussed in <a href="https://sequenceanddestroy.substack.com/p/issue-69-what-actually-makes-someone">Issue #69 // What Actually Makes Someone Good at ML in Computational Biology?</a> where I wrote:</p><blockquote><p>"In my opinion, the most important skill for a bioinformatics scientist or computational biologist isn&#8217;t their ability to code or do advanced math and statistics. It&#8217;s their ability to contextualize the outputs of their code and extract meaningful biological insights."</p></blockquote><p>This also relates to something I discussed in <a href="https://sequenceanddestroy.substack.com/p/when-perfect-code-produces-imperfect">Issue #40: When Perfect Code Produces Imperfect Science</a>, which is that coding is perhaps the simplest part of a bioinformatics scientist&#8217;s job. The real challenge is understanding the vast networks of connections&#8212;how data flows through analytical pipelines, why certain analyses or have to precede others, and which statistical assumptions underpin different methods (as we covered in this piece). All of these interdependencies make bioinformatics projects less like building a house, and more like baking a cake.</p><p>When you build a house specifications translate directly to outcomes. If you add an extra wall the finished product is just the house with an extra wall. When you bake a cake, on the other hand, small variations in technique or timing can produce dramatically different results. Add a little extra butter and the result may not just be an extra buttery cake&#8212;instead, it could be an oily blob. Similarly, when performing bioinformatics analyses small details make all the difference&#8212;choose Bonferroni correction over Benjamin Hochberg, use a t-test over a Wilcoxon sum rank test, or select a different normalization approach and your biological conclusions can transform entirely, often for the worst. These aren&#8217;t just technical choices but scientific judgments requiring deep understanding of both computational methods and biological systems.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Statistical power is the probability that a test correctly identifies a real effect when one exists. In practice, four things increase your power: larger samples, greater effect sizes, lower variance in the data, and higher significance thresholds (though, the latter is essentially moving the finishing line closer to make it easier to reach, imo). This is why the t-test is preferred over ANOVA for two-group comparisons&#8212;it channels all its power into one specific comparison rather than spreading it across a broader omnibus test.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Why multiple testing inflates type I error: Each test at &#945; = 0.05 (i.e, the threshold value or significance level used to determine is a result is statistically significant) carries a 5% false positive risk. If you run a bunch of tests those small risks compound and the chance that at least one result is a false positive grows fast. For example, given the family-wise error rate formula of <code>1&#8722;(1&#8722;&#945;)&#8319;</code>, the risk of false positives with 1, 5, and 20 tests are 5%, 23%, and 64% respectively. This is why post-hoc corrections like Tukey&#8217;s HSD exist&#8212;they adjust thresholds so the overall risk stays controlled across all comparisons, not just each one individually.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Suppose you&#8217;re comparing cell-free DNA fragment length ratios across control subjects, stage I cancer patients, and stage II/III patients cancer patients. This data is seldom normally distributed, so we may start our pipeline with a Kruskal-Wallis H-test across the three groups. Then, to determine which comparisons are significant we use a Dunn&#8217;s test to see which comparisons are driving the signal in the KW H-test. We then use a Cohen&#8217;s d to quantify the effect size. However, If we were to Cohen&#8217;s classic benchmarks in our interpretation, were may find that all of the effects are tiny. But, knowing that fragment ratios are sensitive&#8212;and differences between groups are inherently small&#8212;we may consider that even a Cohen&#8217;s d 0.4 (technically small-to-medium) can be a clinically meaningful shift in a fragmentation profile. Again, this is why it&#8217;s helpful to define your thresholds before your analysis. </p><div><hr></div><p><strong>About the Author: </strong>I&#8217;m a computational biologist and bioengineer whose work spans precision oncology, biodefense, and human performance. As a founder at NNOXX, I helped develop the first wearable sensor to measure muscle oxygenation and nitric oxide bioactivity non-invasively and in real time, taking a first-to-market technology from concept to commercialization. Since then, my work has expanded to include developing computational models to understand mechanisms of treatment resistance and identify rational drug targets in cancer, among other topics. You can read about what I&#8217;m currently learning, working on, and thinking about on <a href="https://sequenceanddestroy.substack.com">Sequence &amp; Destroy</a>, or reach me <a href="https://www.evanpeikon.com/contact">here</a>. </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Sequence &amp; Destroy is a reader-supported publication. To receive new posts, consider becoming a subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Issue #71 // Deeper Signal]]></title><description><![CDATA[What Whoop's latest patent reveals about the future of wearable sensing]]></description><link>https://sequenceanddestroy.substack.com/p/issue-71-deeper-signal</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-71-deeper-signal</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Sun, 12 Apr 2026 15:14:14 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/29de3362-7a16-4124-b0be-388504804355_1434x846.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JRQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" width="1456" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:558245,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/170026527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong>Thanks for reading!</strong> If you found this post useful, please consider subscribing. I share hands-on computational biology techniques, fresh ways to think about tough problems, and perspectives on a range of related topics. All free, straight to your inbox. </p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://sequenceanddestroy.substack.com/subscribe?"><span>Subscribe now</span></a></p><div><hr></div><h2>Issue &#8470; 71 // <strong>Deeper Signal</strong></h2><p>For the better part of the last five years, I&#8217;ve been predicting that the wearable technology industry would eventually abandon its fixation with single-point sensing&#8212;the use of a single localized device to monitor physical, physiological, or environmental data from one specific location on the body. </p><p>The smart watch, wristband, and smart ring-based form factors aren&#8217;t ubiquitous in continuous health monitoring because the wrist and finger are the best places to measure human physiology. They&#8217;re ubiquitous because they&#8217;re convenient and fashion forward&#8212;the epitome of <a href="https://sequenceanddestroy.substack.com/p/the-garden-of-technological-possibilities">form-first wearable design</a>. Convenience has a way of calcifying into convention though, and for years the dominant players in the space&#8212;Whoop, Oura, Apple&#8212;showed little appetite for disrupting their own product lines. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Kq8I!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00365471-013e-4b15-a84c-f09edb94dd74_1704x396.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Kq8I!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00365471-013e-4b15-a84c-f09edb94dd74_1704x396.png 424w, https://substackcdn.com/image/fetch/$s_!Kq8I!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00365471-013e-4b15-a84c-f09edb94dd74_1704x396.png 848w, https://substackcdn.com/image/fetch/$s_!Kq8I!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00365471-013e-4b15-a84c-f09edb94dd74_1704x396.png 1272w, https://substackcdn.com/image/fetch/$s_!Kq8I!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00365471-013e-4b15-a84c-f09edb94dd74_1704x396.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Kq8I!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00365471-013e-4b15-a84c-f09edb94dd74_1704x396.png" width="728" height="169.18309859154928" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/00365471-013e-4b15-a84c-f09edb94dd74_1704x396.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:396,&quot;width&quot;:1704,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:395630,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/193788240?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fccefcbe7-c8e1-4e5d-8fda-b1311c7b9fc3_1704x396.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Kq8I!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00365471-013e-4b15-a84c-f09edb94dd74_1704x396.png 424w, https://substackcdn.com/image/fetch/$s_!Kq8I!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00365471-013e-4b15-a84c-f09edb94dd74_1704x396.png 848w, https://substackcdn.com/image/fetch/$s_!Kq8I!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00365471-013e-4b15-a84c-f09edb94dd74_1704x396.png 1272w, https://substackcdn.com/image/fetch/$s_!Kq8I!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F00365471-013e-4b15-a84c-f09edb94dd74_1704x396.png 1456w" sizes="100vw"></picture><div></div></div></a><figcaption class="image-caption">Form-first wearables prioritize fashion-forward design. Their sleek appearance, and minimal footprint, make strong visual statements and their ubiquity stems partly from this design philosophy. Function-first wearables inhabit a parallel universe, where biomarkers specificity and accuracy dominant, and design follows from that. </figcaption></figure></div><p>Then, just a few days ago, I saw a patent by Whoop and a trademark filing by Garmin&#8212;both pointing in the same direction&#8212;that changed my read on where the industry may be heading. To understand why, it helps to have context about a biometric that most people in the consumer health space have never heard of: muscle oxygenation (SmO2). As I&#8217;ve written previously in <a href="https://sequenceanddestroy.substack.com/p/understanding-variation-in-muscle?utm_source=publication-search">Dampening the Noise: Making Sense of Variability In Biometric Measurements</a>, SmO2 is best understood as the balance of oxygen supply and demand in skeletal muscle. </p><div class="callout-block" data-callout="true"><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bkRQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb27bb93a-998d-4d0d-b44a-05763c4181ea_1906x760.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bkRQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb27bb93a-998d-4d0d-b44a-05763c4181ea_1906x760.png 424w, https://substackcdn.com/image/fetch/$s_!bkRQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb27bb93a-998d-4d0d-b44a-05763c4181ea_1906x760.png 848w, https://substackcdn.com/image/fetch/$s_!bkRQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb27bb93a-998d-4d0d-b44a-05763c4181ea_1906x760.png 1272w, https://substackcdn.com/image/fetch/$s_!bkRQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb27bb93a-998d-4d0d-b44a-05763c4181ea_1906x760.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bkRQ!,w_2400,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb27bb93a-998d-4d0d-b44a-05763c4181ea_1906x760.png" width="704" height="280.713536201469" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b27bb93a-998d-4d0d-b44a-05763c4181ea_1906x760.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;large&quot;,&quot;height&quot;:760,&quot;width&quot;:1906,&quot;resizeWidth&quot;:704,&quot;bytes&quot;:800645,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/193788240?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb55d90f-93ce-4be1-9bc3-f8c6b1ff91be_1906x760.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-large" alt="" srcset="https://substackcdn.com/image/fetch/$s_!bkRQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb27bb93a-998d-4d0d-b44a-05763c4181ea_1906x760.png 424w, https://substackcdn.com/image/fetch/$s_!bkRQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb27bb93a-998d-4d0d-b44a-05763c4181ea_1906x760.png 848w, https://substackcdn.com/image/fetch/$s_!bkRQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb27bb93a-998d-4d0d-b44a-05763c4181ea_1906x760.png 1272w, https://substackcdn.com/image/fetch/$s_!bkRQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb27bb93a-998d-4d0d-b44a-05763c4181ea_1906x760.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></div><p>The basic principle behind measuring muscle oxygenation is fairly simple. Near-infrared light penetrates biological tissue with relative ease, and when directed at muscle, some of those photons are absorbed by hemoglobin and myoglobin&#8212;the oxygen-carrying proteins in blood and muscle cells respectively&#8212;while the rest scatter back to a detector on the skin surface. Oxygenated and deoxygenated forms of these proteins absorb light differently across wavelengths, so by measuring how much light returns at each wavelength, you can infer the ratio of oxygenated to deoxygenated hemoglobin in the tissue below the sensor.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!04XV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe67b4e7c-c98b-4748-964b-31002292d358_1280x1138.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!04XV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe67b4e7c-c98b-4748-964b-31002292d358_1280x1138.png 424w, https://substackcdn.com/image/fetch/$s_!04XV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe67b4e7c-c98b-4748-964b-31002292d358_1280x1138.png 848w, https://substackcdn.com/image/fetch/$s_!04XV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe67b4e7c-c98b-4748-964b-31002292d358_1280x1138.png 1272w, https://substackcdn.com/image/fetch/$s_!04XV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe67b4e7c-c98b-4748-964b-31002292d358_1280x1138.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!04XV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe67b4e7c-c98b-4748-964b-31002292d358_1280x1138.png" width="386" height="343.178125" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e67b4e7c-c98b-4748-964b-31002292d358_1280x1138.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1138,&quot;width&quot;:1280,&quot;resizeWidth&quot;:386,&quot;bytes&quot;:202452,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/193788240?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fecea6ffe-326b-425b-8e07-c6c3224c8927_1280x1280.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!04XV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe67b4e7c-c98b-4748-964b-31002292d358_1280x1138.png 424w, https://substackcdn.com/image/fetch/$s_!04XV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe67b4e7c-c98b-4748-964b-31002292d358_1280x1138.png 848w, https://substackcdn.com/image/fetch/$s_!04XV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe67b4e7c-c98b-4748-964b-31002292d358_1280x1138.png 1272w, https://substackcdn.com/image/fetch/$s_!04XV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe67b4e7c-c98b-4748-964b-31002292d358_1280x1138.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This is different from what your Apple Watch, Whoop, or Oura ring is doing when it estimates blood oxygen (SpO2). Those devices use pulse oximetry&#8212;a technique that isolates the pulsatile arterial signal in the capillary beds near the skin&#8217;s surface (1-2mm deep), giving you a read on how well oxygenated your blood is. Muscle oxygenation, by contrast, is measured much deeper (~20mm), past the skin and subcutaneous fat, reaching the muscle itself. Where pulse oximetry tells you about oxygen in arterial blood, mNIRS tells you about oxygen in the tissue&#8217;s microvasculature&#8212;a meaningfully different thing. When you do a heavy set of squats, the SpO2 reading on your wrist barely moves. The SmO2 signal in your quadriceps on the other hand drops rapidly (as much as a ~75% change in oxygen saturation) as demand outpaces supply, then rebounds during rest&#8212;telling you something concrete about how your muscles are responding to load, recovering between sets, and adapting over time.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Hivp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93db5c2a-aace-47e4-8a2f-f02b8701d939_1328x463.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Hivp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93db5c2a-aace-47e4-8a2f-f02b8701d939_1328x463.png 424w, https://substackcdn.com/image/fetch/$s_!Hivp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93db5c2a-aace-47e4-8a2f-f02b8701d939_1328x463.png 848w, https://substackcdn.com/image/fetch/$s_!Hivp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93db5c2a-aace-47e4-8a2f-f02b8701d939_1328x463.png 1272w, https://substackcdn.com/image/fetch/$s_!Hivp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93db5c2a-aace-47e4-8a2f-f02b8701d939_1328x463.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Hivp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93db5c2a-aace-47e4-8a2f-f02b8701d939_1328x463.png" width="636" height="221.7379518072289" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/93db5c2a-aace-47e4-8a2f-f02b8701d939_1328x463.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:463,&quot;width&quot;:1328,&quot;resizeWidth&quot;:636,&quot;bytes&quot;:402795,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179376121?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73706670-6826-4fa4-a502-44ca91c12040_1362x492.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Hivp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93db5c2a-aace-47e4-8a2f-f02b8701d939_1328x463.png 424w, https://substackcdn.com/image/fetch/$s_!Hivp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93db5c2a-aace-47e4-8a2f-f02b8701d939_1328x463.png 848w, https://substackcdn.com/image/fetch/$s_!Hivp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93db5c2a-aace-47e4-8a2f-f02b8701d939_1328x463.png 1272w, https://substackcdn.com/image/fetch/$s_!Hivp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F93db5c2a-aace-47e4-8a2f-f02b8701d939_1328x463.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">A cyclist&#8217;s muscle oxygenation and acceleration&#8212;both recorded with a NNOXX devices&#8212;during three high-intensity intervals. </figcaption></figure></div><p>I first became interested in muscle oxygenation monitoring and mNIRS about twelve years ago, well before it had a meaningful foothold in consumer health. I later developed the first NSCA-certified course on training with muscle oxygenation for Moxy, and eventually co-founded NNOXX, one of the companies bringing this technology to a broader market. So when I say I&#8217;ve been watching this space closely, I mean it.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YrIC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F729ec39d-fc53-4858-89ba-8bb62f686797_1906x592.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YrIC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F729ec39d-fc53-4858-89ba-8bb62f686797_1906x592.png 424w, https://substackcdn.com/image/fetch/$s_!YrIC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F729ec39d-fc53-4858-89ba-8bb62f686797_1906x592.png 848w, https://substackcdn.com/image/fetch/$s_!YrIC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F729ec39d-fc53-4858-89ba-8bb62f686797_1906x592.png 1272w, https://substackcdn.com/image/fetch/$s_!YrIC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F729ec39d-fc53-4858-89ba-8bb62f686797_1906x592.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YrIC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F729ec39d-fc53-4858-89ba-8bb62f686797_1906x592.png" width="1456" height="452" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/729ec39d-fc53-4858-89ba-8bb62f686797_1906x592.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:452,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1443772,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/193788240?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F729ec39d-fc53-4858-89ba-8bb62f686797_1906x592.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YrIC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F729ec39d-fc53-4858-89ba-8bb62f686797_1906x592.png 424w, https://substackcdn.com/image/fetch/$s_!YrIC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F729ec39d-fc53-4858-89ba-8bb62f686797_1906x592.png 848w, https://substackcdn.com/image/fetch/$s_!YrIC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F729ec39d-fc53-4858-89ba-8bb62f686797_1906x592.png 1272w, https://substackcdn.com/image/fetch/$s_!YrIC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F729ec39d-fc53-4858-89ba-8bb62f686797_1906x592.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Training with muscle oxygenation / mNIRS in the days before smart-phone based UIs were available. </figcaption></figure></div><p>Over the past year, I&#8217;ve been trying to understand the trajectory of consumer awareness of this technology and how it compares to other measurements. The figure below shows Google Trends data for muscle oxygenation and heart rate variability&#8212;HRV, the wearable industry&#8217;s favorite recovery metric over the past decade&#8212;over the previous 25 years. Looking at the chart, two very different trajectories emerge. HRV&#8217;s rise&#8212;starting in ~2014&#8212;was steep and relatively sustained, carried upward by a succession of product launches from Whoop and Oura, each new device generation producing another wave of consumer interest. But look more carefully at what drove those waves, and something becomes apparent: with each successive product release, both companies seem to be extracting diminishing returns in terms of search popularity. Their hardware iterations still move the needle, but by less each time. The single biggest driver of consumer interest in wearable health monitoring over the past decade wasn&#8217;t a product launch at all&#8212;it was COVID-19, which sent search interest in biometric tracking sharply upward across the board as people became acutely aware of their own physiological states in a way they hadn&#8217;t been before.</p><div class="callout-block" data-callout="true"><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!pFOf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d80ef0f-74bf-4ef1-a35d-a6712afbf891_2842x1424.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!pFOf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d80ef0f-74bf-4ef1-a35d-a6712afbf891_2842x1424.png 424w, https://substackcdn.com/image/fetch/$s_!pFOf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d80ef0f-74bf-4ef1-a35d-a6712afbf891_2842x1424.png 848w, https://substackcdn.com/image/fetch/$s_!pFOf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d80ef0f-74bf-4ef1-a35d-a6712afbf891_2842x1424.png 1272w, https://substackcdn.com/image/fetch/$s_!pFOf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d80ef0f-74bf-4ef1-a35d-a6712afbf891_2842x1424.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!pFOf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d80ef0f-74bf-4ef1-a35d-a6712afbf891_2842x1424.png" width="1456" height="730" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3d80ef0f-74bf-4ef1-a35d-a6712afbf891_2842x1424.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:730,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:833993,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/193788240?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d80ef0f-74bf-4ef1-a35d-a6712afbf891_2842x1424.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!pFOf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d80ef0f-74bf-4ef1-a35d-a6712afbf891_2842x1424.png 424w, https://substackcdn.com/image/fetch/$s_!pFOf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d80ef0f-74bf-4ef1-a35d-a6712afbf891_2842x1424.png 848w, https://substackcdn.com/image/fetch/$s_!pFOf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d80ef0f-74bf-4ef1-a35d-a6712afbf891_2842x1424.png 1272w, https://substackcdn.com/image/fetch/$s_!pFOf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d80ef0f-74bf-4ef1-a35d-a6712afbf891_2842x1424.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></div><p>Against that backdrop, the muscle oxygenation trend line looks different&#8212;not a smooth climb, but a series of spikes and retreats, each one triggered by a new company entering the space. Moxy, the first company to release a consumer-focused muscle oximeter, generated an initial spike of interest that then subsided. The dynamics of emerging technology markets are counterintuitive in this way: a lone pioneer often fails to create a category, because without competitors, there&#8217;s no broader narrative for consumers or media to latch onto. Whoop and Oura grew together precisely because they were foils for one another&#8212;each company&#8217;s marketing reinforcing the other&#8217;s, collectively expanding the conversation about wearable health monitoring. Moxy had no such foil. Lacking real competitors, it also lacked the network effects, market validation, and sustained consumer awareness that competition, paradoxically, tends to generate.</p><p>It wasn&#8217;t until Humon&#8212;a VC-backed spinout from MIT&#8217;s Venture Development Center&#8212;commercialized its flagship device in 2018 that muscle oxygenation seemed poised to achieve the kind of mainstream traction that eluded Moxy. Then, two years later, Humon shut down. In their closing letter, they cited the core challenge: muscle oxygen was a new and somewhat misunderstood metric that required sustained market education to exist. Building a user base around a concept most consumers have never encountered is expensive and slow. Without the runway to outlast that educational curve, even a good product can fail. Notably though, Whoop saw the potential in what Humon had built, acquiring their assets and team in a move that looked less like a corporate merger than a targeted acquihire&#8212;tellingly, Humon&#8217;s CTO Daniel Wiese became Whoop&#8217;s Director of R&amp;D shortly after.</p><div class="callout-block" data-callout="true"><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!pcvb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8671ec0c-d247-45a3-b3d1-3a8a70acf478_1400x754.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!pcvb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8671ec0c-d247-45a3-b3d1-3a8a70acf478_1400x754.png 424w, https://substackcdn.com/image/fetch/$s_!pcvb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8671ec0c-d247-45a3-b3d1-3a8a70acf478_1400x754.png 848w, https://substackcdn.com/image/fetch/$s_!pcvb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8671ec0c-d247-45a3-b3d1-3a8a70acf478_1400x754.png 1272w, https://substackcdn.com/image/fetch/$s_!pcvb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8671ec0c-d247-45a3-b3d1-3a8a70acf478_1400x754.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!pcvb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8671ec0c-d247-45a3-b3d1-3a8a70acf478_1400x754.png" width="561" height="302.1385714285714" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8671ec0c-d247-45a3-b3d1-3a8a70acf478_1400x754.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:754,&quot;width&quot;:1400,&quot;resizeWidth&quot;:561,&quot;bytes&quot;:181010,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/193788240?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8671ec0c-d247-45a3-b3d1-3a8a70acf478_1400x754.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!pcvb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8671ec0c-d247-45a3-b3d1-3a8a70acf478_1400x754.png 424w, https://substackcdn.com/image/fetch/$s_!pcvb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8671ec0c-d247-45a3-b3d1-3a8a70acf478_1400x754.png 848w, https://substackcdn.com/image/fetch/$s_!pcvb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8671ec0c-d247-45a3-b3d1-3a8a70acf478_1400x754.png 1272w, https://substackcdn.com/image/fetch/$s_!pcvb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8671ec0c-d247-45a3-b3d1-3a8a70acf478_1400x754.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></div><p>Following Humon&#8217;s closure, search interest in muscle oxygenation drifted back toward pre-Humon levels, reinforcing the impression that the category might simply not be ready. Then my co-founders and I announced the launch of NNOXX in 2021, after which Google Trends for muscle oxygenation reached an all-time high. Two years later when NNOXX and Train.Red&#8212;a European mNIRS company that launched around the same time&#8212;both brought their products to market in 2023, something new happened: for the first time, two well-funded, consumer-focused muscle oximeter companies were operating simultaneously, creating the competitive dynamic the category had always needed. From that point, interest in muscle oxygenation didn&#8217;t just recover&#8212;it climbed steadily and has continued to do so, now on a trajectory to overtake heart rate variability in search popularity within the next few years. Given how thoroughly HRV came to define the wearable technology conversation over the past decade, that&#8217;s an incredible thing to say about a metric that was essentially invisible to the general public not long ago.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!UAc7!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d23022c-078f-43b5-b9e2-a779418028bd_2219x423.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UAc7!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d23022c-078f-43b5-b9e2-a779418028bd_2219x423.png 424w, https://substackcdn.com/image/fetch/$s_!UAc7!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d23022c-078f-43b5-b9e2-a779418028bd_2219x423.png 848w, https://substackcdn.com/image/fetch/$s_!UAc7!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d23022c-078f-43b5-b9e2-a779418028bd_2219x423.png 1272w, https://substackcdn.com/image/fetch/$s_!UAc7!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d23022c-078f-43b5-b9e2-a779418028bd_2219x423.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UAc7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d23022c-078f-43b5-b9e2-a779418028bd_2219x423.png" width="728" height="139" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7d23022c-078f-43b5-b9e2-a779418028bd_2219x423.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:false,&quot;imageSize&quot;:&quot;normal&quot;,&quot;height&quot;:278,&quot;width&quot;:1456,&quot;resizeWidth&quot;:728,&quot;bytes&quot;:1010205,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179376121?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F151ccd7b-285c-4d10-b09a-22a5497209da_2232x430.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:&quot;center&quot;,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!UAc7!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d23022c-078f-43b5-b9e2-a779418028bd_2219x423.png 424w, https://substackcdn.com/image/fetch/$s_!UAc7!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d23022c-078f-43b5-b9e2-a779418028bd_2219x423.png 848w, https://substackcdn.com/image/fetch/$s_!UAc7!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d23022c-078f-43b5-b9e2-a779418028bd_2219x423.png 1272w, https://substackcdn.com/image/fetch/$s_!UAc7!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d23022c-078f-43b5-b9e2-a779418028bd_2219x423.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">The genesis of NNOXX from concept (2020) to full-scale commercialization (2023)</figcaption></figure></div><p>Partly based on the above, my hypothesis was that Whoop and Oura would be forced toward distributed sensor architectures&#8212;multi-point sensing systems that capture physiological data from multiple locations on the body, rather than a single wrist or finger&#8212;both in order to stay competitive with one another and to fight off up and comers, which I previously wrote about in <a href="https://sequenceanddestroy.substack.com/p/distributed-by-design?utm_source=publication-search">Distributed by Design</a>. My logic was as follows: there&#8217;s only so many signals you can measure at any one anatomical site, and the first company to move beyond that constraint would gain a durable competitive advantage. But I also wrote, perhaps too confidently, that Whoop appeared to be doubling down on single-point sensing while Oura was already gesturing toward distributed architecture:</p><blockquote><p>"Companies like Oura have already started moving in this direction by itegrating data from constant glucose monitors into their platform. This transition from single to multi-point sensing represents a schism opening up between companies like Oura, which are embracing distributed network sensing, and companies like Whoop who are doubling down on their single-point sensing technologies, making progressively bolder claims about its capabilities."</p></blockquote><p>It was against this backdrop that I came across Whoop&#8217;s latest patent filing last week&#8212;listing, among its inventors, the founding team from Humon. On one hand, I&#8217;ve suspected Whoop would eventually deploy the assets they acquired from Humon in a new product rollout. On the other, I&#8217;d grown skeptical of their direction, assuming they&#8217;d continue abstracting away from their sensors&#8217; underlying capabilities in favor of surface-level differentiation as I wrote about in <a href="https://sequenceanddestroy.substack.com/p/issue-59-castles-on-quicksand">Castes on Quicksand: A Consumer Health Tracking Story</a>.  </p><p>That said, the filing itself raises more questions than it answers. The sensor architecture it describes appears to rely on placing the existing Whoop device against different muscle groups to capture mNIRS data&#8212;rather than introducing a new, purpose-built sensor. Recent speculation about a mystery device spotted on Tadej Poga&#269;ar during the 2026 Strade Bianche seems to point in a related direction&#8212;though what exactly was being measured remains unclear.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7w7e!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c8a7947-bf3f-4fdf-920b-ed136d561c3f_1190x672.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7w7e!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c8a7947-bf3f-4fdf-920b-ed136d561c3f_1190x672.png 424w, https://substackcdn.com/image/fetch/$s_!7w7e!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c8a7947-bf3f-4fdf-920b-ed136d561c3f_1190x672.png 848w, https://substackcdn.com/image/fetch/$s_!7w7e!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c8a7947-bf3f-4fdf-920b-ed136d561c3f_1190x672.png 1272w, https://substackcdn.com/image/fetch/$s_!7w7e!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c8a7947-bf3f-4fdf-920b-ed136d561c3f_1190x672.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7w7e!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c8a7947-bf3f-4fdf-920b-ed136d561c3f_1190x672.png" width="534" height="301.5529411764706" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8c8a7947-bf3f-4fdf-920b-ed136d561c3f_1190x672.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:672,&quot;width&quot;:1190,&quot;resizeWidth&quot;:534,&quot;bytes&quot;:956025,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/193788240?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c8a7947-bf3f-4fdf-920b-ed136d561c3f_1190x672.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7w7e!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c8a7947-bf3f-4fdf-920b-ed136d561c3f_1190x672.png 424w, https://substackcdn.com/image/fetch/$s_!7w7e!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c8a7947-bf3f-4fdf-920b-ed136d561c3f_1190x672.png 848w, https://substackcdn.com/image/fetch/$s_!7w7e!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c8a7947-bf3f-4fdf-920b-ed136d561c3f_1190x672.png 1272w, https://substackcdn.com/image/fetch/$s_!7w7e!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c8a7947-bf3f-4fdf-920b-ed136d561c3f_1190x672.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><a href="https://www.cyclingnews.com/cycling-tech-components/what-was-the-strange-black-arm-sensor-worn-by-tadej-pogacar-at-strade-bianche/">Source</a></figcaption></figure></div><blockquote><p><strong>Why does this mater?</strong> In February 2026, Whoop launched Whoop Advanced Labs in the UAE. Two months later, Team UAE Emirates&#8212;the professional cycling team Poga&#269;ar rides for and which he led to back-to-back Tour de France victories in 2024 and 2025&#8212;announced a partnership with Whoop, naming it their "Official Health and Performance Wearable." Poga&#269;ar is widely considered the GOAT of cycling, and equipment choices during races attract the kind of scrutiny in cycling circles that an Apple launch might generate in tech. A mystery sensor on his arm during Strade Bianche is not an accidental sighting. Whether it represents a Whoop 5.0 variant with enhanced optical capabilities (cleaner HRV, heart rate, and blood oxygenation readings during exercise), or something further along a development roadmap toward muscle oximetry, is the question. </p></blockquote><p>If that reading is correct, the patent may be deliberately vague about a problem they haven't solved yet. That problem is source-detector geometry. Near-infrared light needs sufficient separation between the emitter and the photodetector&#8212;typically around 30mm&#8212;to ensure that photons penetrate deep enough into tissue to interact with muscle hemoglobin rather than simply scattering off the skin surface or being absorbed by subcutaneous fat. This isn&#8217;t a software problem or something a better algorithm can compensate for. It&#8217;s also the reason that Moxy, NNOXX, Train.Red, and BSX&#8212;companies that arrived at their designs independently, in different countries, over different time periods&#8212;all ended up with eerily similar form factors. The geometry is a constraint, not an aesthetic choice. A device optimized for wrist-based photoplethysmography isn&#8217;t going to become a muscle oximeter by virtue of being strapped to a bicep or quadricep.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!yy2U!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e911ca5-011e-4e5b-8af1-6cd0f40fb569_806x508.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!yy2U!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e911ca5-011e-4e5b-8af1-6cd0f40fb569_806x508.png 424w, https://substackcdn.com/image/fetch/$s_!yy2U!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e911ca5-011e-4e5b-8af1-6cd0f40fb569_806x508.png 848w, https://substackcdn.com/image/fetch/$s_!yy2U!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e911ca5-011e-4e5b-8af1-6cd0f40fb569_806x508.png 1272w, https://substackcdn.com/image/fetch/$s_!yy2U!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e911ca5-011e-4e5b-8af1-6cd0f40fb569_806x508.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!yy2U!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e911ca5-011e-4e5b-8af1-6cd0f40fb569_806x508.png" width="390" height="245.80645161290323" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0e911ca5-011e-4e5b-8af1-6cd0f40fb569_806x508.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:508,&quot;width&quot;:806,&quot;resizeWidth&quot;:390,&quot;bytes&quot;:426772,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179376121?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e911ca5-011e-4e5b-8af1-6cd0f40fb569_806x508.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!yy2U!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e911ca5-011e-4e5b-8af1-6cd0f40fb569_806x508.png 424w, https://substackcdn.com/image/fetch/$s_!yy2U!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e911ca5-011e-4e5b-8af1-6cd0f40fb569_806x508.png 848w, https://substackcdn.com/image/fetch/$s_!yy2U!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e911ca5-011e-4e5b-8af1-6cd0f40fb569_806x508.png 1272w, https://substackcdn.com/image/fetch/$s_!yy2U!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0e911ca5-011e-4e5b-8af1-6cd0f40fb569_806x508.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Traditional NIRS wearable applications treat the skin, as well as any tissue within or around the region of interest, as homogenous. The model developed by NNOXX differentiates between multiple, distinct tissue layers: skin, subcutaneous adipose, skeletal muscle, and blood within the muscle and respective capillaries. (<a href="https://preprints.jmir.org/preprint/79347">Corso &amp; Peikon, 2025</a>)</figcaption></figure></div><p>If Whoop&#8217;s plan is to enable meaningful mNIRS readings from their existing hardware, they will almost certainly run into this wall. The alternative&#8212;developing a genuinely new sensor form factor, purpose-built for deep tissue optics&#8212;would represent a more significant strategic departure than anything they&#8217;ve announced publicly. What I suspect is that the patent represents an early-stage disclosure and that the actual product roadmap is still in flux. Whoop is reportedly eyeing an IPO, which creates real pressure to demonstrate a meaningful hardware differentiation story to investors. A credible move into muscle oxygenation, done properly and with the right sensor geometry, would be precisely that story. It would also be their first genuine step toward a distributed sensor architecture.</p><p>The question is what they&#8217;d actually do with a new purpose-built muscle oximeter. I have two predictions, and both stem from a gap that every serious athlete who has tried to train by wearable data has eventually run into. The first is readiness. Whoop already offers a readiness score, as does Oura and virtually every other consumer health wearable. But, these scores share a common limitation: they&#8217;re systemic. They measure how recovered your cardiovascular and autonomic nervous systems are and infer from that whether your body is prepared to perform. The problem is that systemic recovery and local muscle recovery don&#8217;t always move together&#8212;I first learned this when beta-testing an "AI coach" feature for OmegaWave in ~2013.  Your HRV can look excellent the morning after a hard lower-body session while the muscles you trained are still compromised and a wrist-worn device has no way to see that discrepancy. What <a href="https://connect.nnoxx.com/c/give-feedback/new-readiness-and-performance-assessment-now-available-in-nnoxx-s-mobile-app">NNOXX built</a>, and what Whoop could build with the Humon&#8217;s technology, is a readiness score that distinguishes between these two things&#8212;aerobic readiness on one hand, local muscle readiness on the other. The practical implication is significant: rather than asking "is my body ready?" athletes could ask "is this specific muscle group ready?"&#8212;and get an answer grounded in the tissue&#8217;s actual physiological state rather than a systemic proxy for it.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2BXq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61f6ff13-7171-4581-9dd6-109ec8880c96_1270x748.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2BXq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61f6ff13-7171-4581-9dd6-109ec8880c96_1270x748.png 424w, https://substackcdn.com/image/fetch/$s_!2BXq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61f6ff13-7171-4581-9dd6-109ec8880c96_1270x748.png 848w, https://substackcdn.com/image/fetch/$s_!2BXq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61f6ff13-7171-4581-9dd6-109ec8880c96_1270x748.png 1272w, https://substackcdn.com/image/fetch/$s_!2BXq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61f6ff13-7171-4581-9dd6-109ec8880c96_1270x748.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2BXq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61f6ff13-7171-4581-9dd6-109ec8880c96_1270x748.png" width="552" height="325.11496062992126" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/61f6ff13-7171-4581-9dd6-109ec8880c96_1270x748.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:748,&quot;width&quot;:1270,&quot;resizeWidth&quot;:552,&quot;bytes&quot;:1033041,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/193788240?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61f6ff13-7171-4581-9dd6-109ec8880c96_1270x748.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2BXq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61f6ff13-7171-4581-9dd6-109ec8880c96_1270x748.png 424w, https://substackcdn.com/image/fetch/$s_!2BXq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61f6ff13-7171-4581-9dd6-109ec8880c96_1270x748.png 848w, https://substackcdn.com/image/fetch/$s_!2BXq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61f6ff13-7171-4581-9dd6-109ec8880c96_1270x748.png 1272w, https://substackcdn.com/image/fetch/$s_!2BXq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61f6ff13-7171-4581-9dd6-109ec8880c96_1270x748.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The second application is resistance training, and it&#8217;s where I think the opportunity is most underappreciated. The fundamental problem with quantifying strength training using wrist-worn wearables is that the two metrics most devices rely on&#8212;heart rate and accelerometry&#8212;are poor proxies for what actually determines training quality: metabolic demand and mechanical tension at the local muscle level. Heart rate during a set of heavy front squats tells you something about cardiovascular response, but it lags behind the actual effort, saturates quickly, and doesn&#8217;t distinguish between a set that was truly taxing on the working muscles and one where large increases in intra-abdominal pressure were driving increases in heart rate, without a proportional drop in muscle oxygenation. Additionally, accelerometry can count reps, but it says nothing about the physiological cost of producing them.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!RvKH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0b58e1e-ca67-4935-a0f2-e52578e7c5ed_1846x1040.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!RvKH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0b58e1e-ca67-4935-a0f2-e52578e7c5ed_1846x1040.png 424w, https://substackcdn.com/image/fetch/$s_!RvKH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0b58e1e-ca67-4935-a0f2-e52578e7c5ed_1846x1040.png 848w, https://substackcdn.com/image/fetch/$s_!RvKH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0b58e1e-ca67-4935-a0f2-e52578e7c5ed_1846x1040.png 1272w, https://substackcdn.com/image/fetch/$s_!RvKH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0b58e1e-ca67-4935-a0f2-e52578e7c5ed_1846x1040.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!RvKH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0b58e1e-ca67-4935-a0f2-e52578e7c5ed_1846x1040.png" width="1456" height="820" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d0b58e1e-ca67-4935-a0f2-e52578e7c5ed_1846x1040.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:820,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2984002,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/193788240?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0b58e1e-ca67-4935-a0f2-e52578e7c5ed_1846x1040.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!RvKH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0b58e1e-ca67-4935-a0f2-e52578e7c5ed_1846x1040.png 424w, https://substackcdn.com/image/fetch/$s_!RvKH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0b58e1e-ca67-4935-a0f2-e52578e7c5ed_1846x1040.png 848w, https://substackcdn.com/image/fetch/$s_!RvKH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0b58e1e-ca67-4935-a0f2-e52578e7c5ed_1846x1040.png 1272w, https://substackcdn.com/image/fetch/$s_!RvKH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0b58e1e-ca67-4935-a0f2-e52578e7c5ed_1846x1040.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Muscle oxygenation provides an exercise-agnostic measure of exercise intensity, whether during endurance training, resistance training, or sports like climbing. </figcaption></figure></div><p>SmO2 changes this picture. During a set, you can watch muscle oxygenation drop in real time as demand outpaces supply and the steepness of that drop is a direct readout of exercise intensity at the tissue level, not a cardiovascular surrogate for it. Between sets, the reoxygenation curve tells you how quickly the muscle is recovering: a fast, complete rebound suggests the muscle is ready to work again; a slow or incomplete recovery suggests it isn&#8217;t, regardless of how long the clock says you&#8217;ve been resting. The combination of these two signals&#8212;depletion rate during effort, replenishment rate during recovery&#8212;gives you something that doesn&#8217;t currently exist in any mainstream wearable: an objective, real-time measure of  strain from resistance training.</p><div class="callout-block" data-callout="true"><p> <strong>Note:</strong> On a further reading, a lot of this patent hinges around the pressure sensitive strap. To me, there could be two uses for this. One is ensuring that pressure from the strap securing the sensor doesn&#8217;t impact muscle oxygenation readings. The other is developing an open or closed-loop feedback system for muscle oxygenation-guided BFR training. My sense is that the latter is the target, given that the Humon sensor&#8217;s readings didn&#8217;t appear to be impacted to a large degree by the pressure exerted on the leg from it&#8217;s strap (~30 mmHG).</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nJDc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc529d4e9-bc15-45b6-81e3-59bb79fd7458_2792x1404.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nJDc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc529d4e9-bc15-45b6-81e3-59bb79fd7458_2792x1404.png 424w, https://substackcdn.com/image/fetch/$s_!nJDc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc529d4e9-bc15-45b6-81e3-59bb79fd7458_2792x1404.png 848w, https://substackcdn.com/image/fetch/$s_!nJDc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc529d4e9-bc15-45b6-81e3-59bb79fd7458_2792x1404.png 1272w, https://substackcdn.com/image/fetch/$s_!nJDc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc529d4e9-bc15-45b6-81e3-59bb79fd7458_2792x1404.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nJDc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc529d4e9-bc15-45b6-81e3-59bb79fd7458_2792x1404.png" width="1456" height="732" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c529d4e9-bc15-45b6-81e3-59bb79fd7458_2792x1404.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:732,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2787401,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/193788240?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc529d4e9-bc15-45b6-81e3-59bb79fd7458_2792x1404.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!nJDc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc529d4e9-bc15-45b6-81e3-59bb79fd7458_2792x1404.png 424w, https://substackcdn.com/image/fetch/$s_!nJDc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc529d4e9-bc15-45b6-81e3-59bb79fd7458_2792x1404.png 848w, https://substackcdn.com/image/fetch/$s_!nJDc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc529d4e9-bc15-45b6-81e3-59bb79fd7458_2792x1404.png 1272w, https://substackcdn.com/image/fetch/$s_!nJDc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc529d4e9-bc15-45b6-81e3-59bb79fd7458_2792x1404.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></div><p>Consider the total addressable market for this type of technology. Endurance athletes have had access to objective intensity metrics for decades, both external and internal&#8212;power meters on bikes, pace data from GPS, HR readings, blood lactate, VO2. Strength athletes have had RPE and rep counts. The gap between those two worlds is not just a data gap; it&#8217;s a coaching and training design gap. Closing it with a wearable that can actually see into the muscle would be, for strength training, what the power meter and heart rate were for cycling: a shift from subjective feel to ground truth.</p><p>Whether Whoop gets there, or whether they will be the first major player to enter this space, is uncertain. The constraints of adding an SmO2 reading to a Whoop band are real, and if that is their plan there are reasons to be skeptical. But the pieces are in place in a way they haven&#8217;t been before&#8212;the acquired technology from Humon, the recent patent filing, the IPO pressure, and a market that has spent the last two years discovering that muscle oxygenation is a metric worth caring about. What happens next will be one of the more consequential product stories in wearables in some time.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><div><hr></div><p><strong>About the Author</strong></p><p>I&#8217;m a computational biologist and bioengineer whose work spans precision oncology, biodefense, and human performance. As a founder at NNOXX, I helped build the first platform to measure muscle oxygenation and nitric oxide bioactivity non-invasively and in real time, bringing a first-to-market technology from concept to commercialization. Since then, my work has focused on developing computational models to understand cancer evolution and mechanisms of treatment resistance. You can read about what I&#8217;m currently learning, working on, and thinking about on <a href="https://sequenceanddestroy.substack.com">Sequence &amp; Destroy</a>, or reach me at <a href="mailto:evanpeikon@gmail.com">evanpeikon@gmail.com</a>.</p>]]></content:encoded></item><item><title><![CDATA[Issue #70 // Why Interpretable Models Outperform Black Boxes in Biology]]></title><description><![CDATA[The Case for Mechanistic Understanding and Interpretability in an Age of Black Box ML]]></description><link>https://sequenceanddestroy.substack.com/p/issue-70-why-interpretable-models</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-70-why-interpretable-models</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Sun, 15 Mar 2026 13:16:18 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/69367d34-19cb-4a23-8154-0ac6acb55ab2_1442x848.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JRQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" width="1456" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:558245,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/170026527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>I recently had an interesting conversation on the tension between accuracy and interpretability with the head of ML at a small biotech working on liquid biopsies, which grew out of two earlier pieces: <a href="https://github.com/evanpeikon/Build_A_Biomarker">How to Develop Predictive Biomarkers</a> and <a href="https://sequenceanddestroy.substack.com/p/issue-69-what-actually-makes-someone">What Actually Makes Someone Good at ML in Computational Biology</a>. One of the more interesting facets of the discussion was on how accuracy and interpretability can act as competitive moats, but through totally different mechanisms. </p><p>Accuracy, in a sense, is a commodity that can be bought. Given enough money, you can buy your way to a better model&#8212;more high quality training data, more compute, larger teams, and greater access to proprietary datasets. This gives early movers a huge competitive advantage, but given enough resources a well-funded competitor can close almost any accuracy gap. Interpretability, on the other hand, doesn&#8217;t work that way&#8212;it can&#8217;t be purchased at scale. Instead, it requires deep domain expertise, biological intuition, and the kind of hard-won insight that you can&#8217;t list on a procurement order&#8212;it&#8217;s a bit of a dark art, and that&#8217;s what makes it defensible. </p><p>But, the more interesting question isn&#8217;t which moat is harder to breach. It&#8217;s which produces better science. In this piece, we&#8217;ll explore how interpretable models don&#8217;t just offer a strategic advantage; in biology specifically, they outperform black boxes where it matters most (i.e., clinical translation, generalization to new patient cohorts). Additionally, unlike accuracy, that advantage isn&#8217;t one that more spending can replicate. </p><div><hr></div><h2>Issue &#8470; 70 // <strong>Why Interpretable Models Outperform Black Boxes in Biology</strong></h2><p>Accuracy is a property of a model&#8217;s performance on a test set&#8212;it tells you how often the model is right, but nothing about <em>why</em>. Interpretability, on the other hand, describes a model&#8217;s relationship to the system its modeling, meaning the degree to which the model&#8217;s internal logic mirrors the underlying biology. The distinction between accuracy and interpretability can be made more clear with an example. </p><p>After building a biological knowledge graph, you can predict whether an edge exists between two nodes using a graph neural network (GNN). Alternatively, you can predict that same edge with a graph traversal, following chains of known biological relationships to their logical conclusion. The GNN might outperform the traversal on a test set, but the traversal tells you <em>how</em> those nodes are connected and <em>why</em> that connection is plausible. One approach gives you an answer while the other gives you a hypothesis. This same tension pops up when trying to predict treatment response. For example, a deep learning model trained on reverse phase protein array data may predict an HR-/HER2- breast cancer patient&#8217;s response to immunotherapy with high accuracy. A 5-gene signature, by contrast, will likely perform worse on held-out data, but is has the advantage of telling you which biological processes are doing the work and it gives a clinician tangible information to act on. </p><p>In most domains, we accept the trade-off between accuracy and interpretability without much deliberation. A bit of interpretability is worth giving up if the model generalizes reasonably well and ships on time. But, in biology there&#8217;s a third consideration that changes the calculus entirely: the model needs to suggest the next experiment.</p><p>This is why the standard machine learning framework&#8212;train, validate, test, deploy&#8212;doesn&#8217;t map cleanly onto biological research. In a recommendation system, deployment is the goal. In biology, deployment is the beginning of the interesting part. A classifier that predicts tumor response with 95% accuracy but offers no mechanistic hypothesis about why certain tumors respond is a dead end from a scientific standpoint. An interpretable model that achieves 88% accuracy but identifies a specific regulatory relationship&#8212;say, that tumors with elevated hypoxia signatures and low immune infiltration are the ones failing to respond&#8212;gives you a lever to pull. It tells you what to measure in the next cohort, what to interrogate in the next cell line experiment, and what pathway to target in the next therapeutic design.</p><p>George Church touched on this in a recent interview with the Lifespan Research Institute, which I tend to agree with:</p><blockquote><p style="text-align: justify;"><strong>Would you prefer a weaker but more interpretable AI or a stronger but less interpretable one?</strong></p><p style="text-align: justify;">GC: I lean on the interpretability side. It&#8217;s not an either-or, but&#8230; we&#8217;re in science. Few engineers are willing to just pull a rabbit out of a hat, just a black box. Scientists and engineers, by and large, want to know the mechanism. The FDA likes to know mechanisms. Typically, the autocatalytic loop where you learn something and then you invent something is better if it&#8217;s mechanistically grounded. So, I lean pretty heavily in the direction of interpretability, explainability, transparency, et cetera, and also it&#8217;s safer.</p><p style="text-align: justify;"><strong>I just honestly think that we will soon be faced with this dilemma, where we will have to choose between the power of the model to do things and its actual interpretability, but maybe we&#8217;re not there yet.</strong></p><p style="text-align: justify;">GC: If you look at the human scientist experience, the most powerful sciences are the ones that are better articulated mechanistically on a solid foundation rather than black boxes. The black boxes tend to include artifacts, dead ends. Most of the progress in science and engineering has been part of community efforts with strong mechanistic underpinnings.</p></blockquote><p>There&#8217;s another, subtler, version of this argument that goes beyond experimental utility. Interpretable models that align with biological mechanisms are more likely to generalize across contexts, because they&#8217;ve learned something about the causal structure of the system rather than correlations in a particular dataset. A gene expression signature that works as a classifier in one cohort may fail in another if the cohort-specific technical and demographic variables are confounded with the biology. For example, if your training cohort happens to be predominantly collected at a single institution, on a particular sequencing platform, with a skewed demographic composition, the model will quietly bake all of that in. Then, when it&#8217;s applied to a new cohort with different technical and demographic variables, it often fail (and more frustratingly, it fails without telling you why).</p><p>A model built around a mechanistically interpretable pathway is more robust to this kind of distribution shift because the mechanism travels with the biology, not with the dataset. For example, a hypoxia signature exists in a tumor regardless of where the sample was collected or how it was sequenced. Anchoring your model to that kind of structure gives it something real to hold onto across contexts. This isn&#8217;t always true, of course. Mechanisms can be context dependent as previously discussed in<a href="https://sequenceanddestroy.substack.com/p/molecular-moonlighting?utm_source=publication-search"> Issue #55 // Molecular Moonlighting</a>. But, even when interpretable models fail, they tend to fail informatively. If your 5-gene signature stops being predictive in a new cohort, that&#8217;s a hypothesis: perhaps hypoxia isn&#8217;t the rate-limiting factor in this population, or perhaps there&#8217;s a moderating variable you haven&#8217;t accounted for. A black box achieving 0.51 AUC on held-out data gives you nothing to work with. The failure mode of an interpretable model is, in a sense, still science. </p><div><hr></div><p><strong>Thanks for reading!</strong> If you found this post useful, please consider subscribing. I share hands-on computational biology techniques, fresh ways to think about tough problems, and perspectives on a range of related topics. All free, straight to your inbox.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://sequenceanddestroy.substack.com/subscribe?"><span>Subscribe now</span></a></p><p></p>]]></content:encoded></item><item><title><![CDATA[Issue #69 // What Actually Makes Someone Good at ML in Computational Biology?]]></title><description><![CDATA[What separates good practitioners from great ones isn&#8217;t the code they write&#8212;it&#8217;s how they interrogate the output and what they bring to that interrogation that no automated pipeline can replicate.]]></description><link>https://sequenceanddestroy.substack.com/p/issue-69-what-actually-makes-someone</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-69-what-actually-makes-someone</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Fri, 27 Feb 2026 11:02:46 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/88b6e531-3eb0-4020-a661-f8632ba4bea4_1382x812.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JRQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" width="1456" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:558245,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/170026527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>Earlier this week I shared a new tutorial, <strong><a href="https://github.com/evanpeikon/Build_A_Biomarker/tree/main">How to Develop Predictive Biomarkers</a></strong>, which is designed to fill a conceptual gap&#8212;something I&#8217;ve identified when searching for resources that help computational biologists with intermediate machine learning experience leverage their unique skillset to create biomarker panels for predicting clinical outcomes (like treatment response in triple-negative breast cancer, for example). </p><p>Most existing resources cover isolated parts of ML model development, but none that I&#8217;ve come across explain the big picture&#8212;how the different parts of the pipeline fit together. This tutorial aims to do that, guiding you through the thinking process at each step from data acquisition to model deployment. Check it out, and please pass it along if you find it useful! </p>
      <p>
          <a href="https://sequenceanddestroy.substack.com/p/issue-69-what-actually-makes-someone">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[Issue #68 // Connectivity Is the Target]]></title><description><![CDATA[Network Topology and Drug Discovery in Cancer]]></description><link>https://sequenceanddestroy.substack.com/p/issue-68-connectivity-is-the-target</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-68-connectivity-is-the-target</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Sun, 22 Feb 2026 11:02:06 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/66cc9cc5-857a-4053-bd15-d6bc6e983b9a_1384x816.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JRQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" width="1456" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:558245,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/170026527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong>Thanks for reading!</strong> If you found this post useful, please consider subscribing. I share hands-on computational biology techniques, fresh ways to think about tough problems, and perspectives on a range of related topics. All free, straight to your inbox:</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://sequenceanddestroy.substack.com/subscribe?"><span>Subscribe now</span></a></p><div><hr></div><h2><strong>Issue </strong>&#8470; 68 // Connectivity Is the Target</h2><p>In the early 2000s, Barab&#225;si and colleagues introduced the idea of using network science to identify essential, highly connected protein "hubs" in molecular interaction networks as prime therapeutic targets. The core idea here is that by targeting, and disrupting, these central nodes (which often reside within disease-specific network modules) drugs can collapse pathological signaling while minimizing off target/adverse effects elsewhere.  </p><p>Conceptually, idea makes a lot of sense to me. But, in practice it&#8217;s surprisingly difficult to prove that a given hub is actually a good drug target rather than just a topologically interesting node. This piece is about a framework I&#8217;ve been playing around with to do exactly that. I&#8217;m writing this partly to clarify my own thinking, and partly to invite feedback from others working on similar problems.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h3><strong>What Makes a Good Drug Target?</strong></h3><p>In <a href="https://sequenceanddestroy.substack.com/p/issue-58-how-to-kill-a-tumor">Issue #58: How to Kill a Tumor</a>, I wrote about different schools of thought on drug target identification in cancer. These include finding proteins that are differentially expressed on tumor cell surfaces relative to healthy tissue (the ADC approach), targeting molecular drivers of disease (HER2-directed therapy, AKT inhibitors, and so on), and disrupting network topology, which is the angle Barab&#225;si&#8217;s work opens up. In an ideal scenario, all three converge on the same protein, resulting in total pathway collapse.</p><p>For the first two approaches, the evidentiary chain is relatively clean and easy to understand. Let&#8217;s take the case of identifying an ADC (antibody drug conjugate) target, for example. To do this you&#8217;d first want to find proteins that are up-regulated in tumors relative to healthy tissue, ensuring that (a) the cytotoxic payload hits the right target and (b) that off-target effects are limited. Next, you&#8217;d want to confirm your candidate targets are associated with treatment non-response by seeing if they are further elevated in patients who fail standard-of-care treatment relative to those who response favorable (here, we can define "respond" as achieving pathological complete response, or pCR, for example). Finally, we&#8217;d want to show that high expression of the candidate protein predicts disease recurrence in the treatment-refractory population via Kaplan-Meier and Cox proportional hazards analysis. There&#8217;s a legible chain of cause and effect here, and the logic maps directly onto clinical intuition&#8212;the protein is elevated in tumors, associated with non-response, and patients with high expression of it have worse survival. No one will batt an eye if you use this logic to propose a new drug target. </p><p>The network hub case is harder. Many of the standard intuitions about what makes a good drug target don&#8217;t cleanly transfer, and even defining what it means for a protein to be a hub is non-trivial. Degree centrality, betweenness centrality, and closeness centrality all capture different aspects of topological importance, and there&#8217;s no consensus on which metric is most clinically relevant.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h3><strong>Defining "Core" Hubs</strong></h3><p>Recently, I&#8217;ve been developing the concept of "core hubs," which I define as proteins satisfying two criteria simultaneously<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>&#8230;</p><ol><li><p>First, core hubs must be rich-club members, which are proteins whose degree exceeds the threshold <em>k</em> where the normalized rich-club coefficient (&#966;_norm) reaches its maximum value in the protein co-expression network. The rich-club phenomenon describes the tendency of nodes of similar degrees to be more densely interconnected with each other than would be expected by chance, forming a kind of elite backbone of the network. </p></li><li><p>Second, core hubs must also rank in the top 10% of all proteins by degree centrality. </p></li></ol><blockquote><p><strong>An example:</strong> The figure below shows the same protein co-expression network visualized with two complimentary color schemes. The left version is color-coded by pathway membership with nodes sized by their degree centrality. The right version is color-coded by each node&#8217;s normalized rich-club coefficient (&#966;_norm) at their respective degree, with color intensity scaled to the maximum &#966;_norm.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!GS7K!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5dec11f-8d48-4047-9877-78b4fb2d6823_1236x704.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!GS7K!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5dec11f-8d48-4047-9877-78b4fb2d6823_1236x704.png 424w, https://substackcdn.com/image/fetch/$s_!GS7K!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5dec11f-8d48-4047-9877-78b4fb2d6823_1236x704.png 848w, https://substackcdn.com/image/fetch/$s_!GS7K!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5dec11f-8d48-4047-9877-78b4fb2d6823_1236x704.png 1272w, https://substackcdn.com/image/fetch/$s_!GS7K!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5dec11f-8d48-4047-9877-78b4fb2d6823_1236x704.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!GS7K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5dec11f-8d48-4047-9877-78b4fb2d6823_1236x704.png" width="473" height="269.41100323624596" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f5dec11f-8d48-4047-9877-78b4fb2d6823_1236x704.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:704,&quot;width&quot;:1236,&quot;resizeWidth&quot;:473,&quot;bytes&quot;:781561,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/188650532?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5dec11f-8d48-4047-9877-78b4fb2d6823_1236x704.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!GS7K!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5dec11f-8d48-4047-9877-78b4fb2d6823_1236x704.png 424w, https://substackcdn.com/image/fetch/$s_!GS7K!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5dec11f-8d48-4047-9877-78b4fb2d6823_1236x704.png 848w, https://substackcdn.com/image/fetch/$s_!GS7K!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5dec11f-8d48-4047-9877-78b4fb2d6823_1236x704.png 1272w, https://substackcdn.com/image/fetch/$s_!GS7K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff5dec11f-8d48-4047-9877-78b4fb2d6823_1236x704.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The node delineated as A is categorized as a core hub, whereas nodes B and C are not. From the visual, you can se that node B has a very high degree centrality (it&#8217;s the largest node in the network), but it is not a rich club member (hence, the white color in the rightmost visual). Node C on the other hand is a rich-club member, but it is not highly connected. Only node A, in this comparison, fits both criteria being both highly connected (top 10% of nodes by degree centrally) and a rich-club member.</p></blockquote><p>Conceptually, core hub proteins represent the network's most influential nodes, forming a highly connected, mutually reinforcing, central component that acts as a backbone for efficient global communication. Based on these properties, i&#8217;ve hypothesized that disrupting these nodes should cause disproportionate, cascading network failures relative to equivalent disruption of non-hub proteins, making them prime candidates for targeted intervention.</p><p>One early observation that gave me some confidence in this framework is how it maps onto established biology. For example, I&#8217;ve seen that HER2+ breast cancer cohorts have HER2 family proteins appearing as core hubs in their co-expression networks, which is consistent with the known efficacy of HER2-targeted therapies in these populations and suggests that part of what those therapies are doing is producing network-level collapse rather than simply reducing the abundance of a single target. The framework has also surfaced less expected results: HER2 family core hubs appearing in a subset of TNBC patient networks, and certain ADC target candidates appearing as core hubs in molecular subtypes where you might not anticipate it.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h3><strong>The Validation Problem</strong></h3><p>The challenge is proving these targets out. Kaplan-Meier curves are less informative when a candidate protein shows limited expression variance across a cohort&#8212;which is often the case for network hubs, since their importance is topological rather than purely expression-level. What does become compelling is when a given protein appears as a core hub in a specific molecular subtype (say, TNBC), and then when that population is stratified by treatment response, the hub is specific to the non-responder network. That&#8217;s a meaningful signal. But it still doesn&#8217;t tell you whether disrupting that hub would actually collapse the network in a therapeutically relevant way.</p><p>To address this, I&#8217;ve been developing a targeted network attack simulation that works as follows&#8230;</p><ul><li><p>Given a candidate protein A, I simulate a targeted attack by removing that protein and all proteins with co-expression relationships exceeding a defined threshold (|r| &gt; 0.7, for example) from the network. </p></li><li><p>I then quantify network disruption across four metrics: change in network density, change in the size of the largest connected component, change in average local clustering coefficient, and change in the number of connected components. These metrics collectively capture both global connectivity loss and the degree to which the network fragments into isolated subgraphs.</p></li><li><p>To contextualize the disruption caused by targeting protein A, I run upwards of 1,000 random attack simulations on the same network, each time selecting proteins at random and removing them along with their high-correlation neighbors. The disruption scores from the targeted attack on protein A are then expressed as a percentile relative to this empirical random attack distribution. </p></li></ul><p>Using the above framework, a core hub should score near the top of this distribution, indicating that its removal causes significantly more network disruption than would be expected from removing a randomly selected protein of equivalent size&#8212;evidence that its topological position, not just its connectivity, is what makes it influential.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2cp_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d982264-2aff-4cc1-9ee0-7e0d7bb4b35a_1848x622.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2cp_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d982264-2aff-4cc1-9ee0-7e0d7bb4b35a_1848x622.png 424w, https://substackcdn.com/image/fetch/$s_!2cp_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d982264-2aff-4cc1-9ee0-7e0d7bb4b35a_1848x622.png 848w, https://substackcdn.com/image/fetch/$s_!2cp_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d982264-2aff-4cc1-9ee0-7e0d7bb4b35a_1848x622.png 1272w, https://substackcdn.com/image/fetch/$s_!2cp_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d982264-2aff-4cc1-9ee0-7e0d7bb4b35a_1848x622.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2cp_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d982264-2aff-4cc1-9ee0-7e0d7bb4b35a_1848x622.png" width="1848" height="622" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3d982264-2aff-4cc1-9ee0-7e0d7bb4b35a_1848x622.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:622,&quot;width&quot;:1848,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:104129,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/188650532?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2b080983-06c5-43a3-a011-7d3e72116309_1848x622.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2cp_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d982264-2aff-4cc1-9ee0-7e0d7bb4b35a_1848x622.png 424w, https://substackcdn.com/image/fetch/$s_!2cp_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d982264-2aff-4cc1-9ee0-7e0d7bb4b35a_1848x622.png 848w, https://substackcdn.com/image/fetch/$s_!2cp_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d982264-2aff-4cc1-9ee0-7e0d7bb4b35a_1848x622.png 1272w, https://substackcdn.com/image/fetch/$s_!2cp_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d982264-2aff-4cc1-9ee0-7e0d7bb4b35a_1848x622.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Outputs from a targeted attack simulation (protein data on the x-axis is redacted). You can see that the core hub candidate protein (red) falls in the 97th percentile of disruption relative to random attacks on the network. </figcaption></figure></div><p>To further validate the target we can then perform a parallel comparison. For example, if the targeted attack is performed on the TNBC non-responder network where the core hub is present, I perform the same attack on the TNBC responder network where the core hub is absent. The prediction is that attacking the hub in non-responders produces near-maximal network disruption, while the same attack in responders produces disruption near the 50th percentile&#8212;indistinguishable from random. That divergence would constitute meaningful evidence that the hub&#8217;s topological importance is specific to the biological context in which it&#8217;s relevant.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!HGjr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F432c54df-cd30-46a8-ac08-1eebffa0dd1a_772x646.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!HGjr!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F432c54df-cd30-46a8-ac08-1eebffa0dd1a_772x646.png 424w, https://substackcdn.com/image/fetch/$s_!HGjr!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F432c54df-cd30-46a8-ac08-1eebffa0dd1a_772x646.png 848w, https://substackcdn.com/image/fetch/$s_!HGjr!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F432c54df-cd30-46a8-ac08-1eebffa0dd1a_772x646.png 1272w, https://substackcdn.com/image/fetch/$s_!HGjr!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F432c54df-cd30-46a8-ac08-1eebffa0dd1a_772x646.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!HGjr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F432c54df-cd30-46a8-ac08-1eebffa0dd1a_772x646.png" width="372" height="311.2849740932642" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/432c54df-cd30-46a8-ac08-1eebffa0dd1a_772x646.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:646,&quot;width&quot;:772,&quot;resizeWidth&quot;:372,&quot;bytes&quot;:82448,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/188650532?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffc229dc3-2ed4-49d9-b704-d1e76a9328eb_772x646.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!HGjr!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F432c54df-cd30-46a8-ac08-1eebffa0dd1a_772x646.png 424w, https://substackcdn.com/image/fetch/$s_!HGjr!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F432c54df-cd30-46a8-ac08-1eebffa0dd1a_772x646.png 848w, https://substackcdn.com/image/fetch/$s_!HGjr!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F432c54df-cd30-46a8-ac08-1eebffa0dd1a_772x646.png 1272w, https://substackcdn.com/image/fetch/$s_!HGjr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F432c54df-cd30-46a8-ac08-1eebffa0dd1a_772x646.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">An example output from three simulated attacks on MammPrint H1 vs MammaPrint H2 Triple negative breast cancer networks. You can see that attacking these H2-specific core hubs results in maximal disruption in the H2 networks, but not in the comparator H1 networks where these same three candidates fail to achieve score hub status. </figcaption></figure></div><p>One of the most obvious limitations of this approach that I see is that it currently operates on coexpression networks, which capture statistical co-variation between proteins but are agnostic to mechanism. In practice, this could limit some of the causal interpretability of the attack simulations, though I do still think this framework can still meaningfully complement more traditional target validation analyses<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. </p><p>Additionally, over the next few weeks/months, the extension I&#8217;m most interested in is replacing the co-expression networks with causal protein networks, like those generated by <a href="https://github.com/evanpeikon/CausalEdge">CausalEdge</a>. A causal network would allow the attack simulations to operate on directional, mechanistically grounded relationships rather than symmetric correlations, which would substantially strengthen the interpretability of the results. The constraint for now is data: inferring reliable causal networks requires longitudinal proteomic measurements that I don&#8217;t currently have at the necessary scale. But it&#8217;s a natural next step, and one worth keeping in mind as the framework develops.</p><div><hr></div><p><strong>Liked this piece?</strong> If so, tap the &#128420; in the header above. It&#8217;s a small gesture that goes a long way in helping me understand what you value and in growing this newsletter.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>I&#8217;m not sure if this particular set of criteria has been explored elsewhere (perhaps, under a different name?). if you&#8217;ve seen something similar, let me know!</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Working with reverse phase protein array (RPPA) data makes this more actionable than it would be with transcriptomic data, for reasons I&#8217;ll address in a future piece, but the short version is that RPPA measures protein and phosphoprotein abundance directly, making the co-expression relationships closer to functional reality than mRNA-derived networks would be.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Issue #67 // Doping with Data]]></title><description><![CDATA[Issue #66 // Toward A Multi-Dimensional Model of Exercise Intensity]]></description><link>https://sequenceanddestroy.substack.com/p/issue-66-doping-with-data</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-66-doping-with-data</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Sun, 15 Feb 2026 12:00:35 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/1a885417-ab28-4656-8aa0-bbc6469eba44_1254x856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JRQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" width="1456" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:558245,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/170026527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>This issue of <strong>Sequence &amp; Destroy</strong> will be a bit different from the norm. Rather than writing about proteomics, molecular medicine, or any number of other topics I&#8217;ve been <a href="https://sequenceanddestroy.substack.com/archive">exploring as of late</a>, this piece is about a new project I&#8217;ve been working on. One that lives at the intersections of human performance, wearable technology, and machine learning. If you&#8217;re interested in learning more, shoot me an email at evanpeikon@gmail.com. </p><div><hr></div><h2><strong>Issue </strong>&#8470; 67<strong> // Doping with Data </strong></h2><p>Every endurance athlete, coach, and human performance scientist is familiar with training zones&#8212;they&#8217;re ubiquitous across all work capacity-based sports, whether running, cycling, skiing, or rowing. In practice, these zones serve as a common language for coaches and athletes to prescribe, execute, and communicate training&#8225;.</p><blockquote><p>&#8225;Conceptually, training zones are intensity classifications that divide exercise into discrete bands, usually based on heart rate, power, or blood lactate. Additionally, each zone is presumed to elicit a distinct physiological response and training adaptation, and as a result training zones are commonly used to prescribe training (i.e., run for 60-minutes in a zone-2 heart rate between X and Y bpm). </p></blockquote><p>Zone-based training models have become so embedded in coaching culture that questioning them feels almost heretical. But, it&#8217;s my opinion that the standard 5-zone training model is based on two flawed assumptions:</p><ol><li><p>The first is that human physiological can be adequately described by a single biomarker (for example, heart rate). </p></li><li><p>The second is that exercise intensity can be uniformly characterized by five (or any hard-coded number, really) distinct  physiologic states. </p></li></ol><p>Neither of these assumptions holds up to serious examination and the compounding effect of these two over simplifications&#8212;zones based on a single biomarker combined with arbitrary zone counts&#8212;leads to a conceptual framework that often obscures more than it reveals. <strong>I think we can do much better</strong>. Let&#8217;s dive in! </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h3>Problem 1&#8212;The Single Biomarker Fallacy </h3><p>Zone-based training typically relies on one biomarker&#8212;most commonly heart rate. When you prescribe training based on a single biomarker, you&#8217;re by grouping data points alone a line, which forces you to make arbitrary cutoffs. For example, between 109 and 127 beats per minute (bpm) is zone 2, above 165 bpm is zone 5, and so forth. The idea is simple and easily communicated (and hence widely adopted), yet incomplete. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ADXy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadcdd53-cde8-48e5-aa82-ae3c818768d5_938x326.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ADXy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadcdd53-cde8-48e5-aa82-ae3c818768d5_938x326.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ADXy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadcdd53-cde8-48e5-aa82-ae3c818768d5_938x326.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ADXy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadcdd53-cde8-48e5-aa82-ae3c818768d5_938x326.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ADXy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadcdd53-cde8-48e5-aa82-ae3c818768d5_938x326.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ADXy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadcdd53-cde8-48e5-aa82-ae3c818768d5_938x326.jpeg" width="579" height="201.23027718550105" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/aadcdd53-cde8-48e5-aa82-ae3c818768d5_938x326.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:326,&quot;width&quot;:938,&quot;resizeWidth&quot;:579,&quot;bytes&quot;:47480,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa3fc3380-d5d5-4144-8d02-9ebe3ff22b8c_960x344.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ADXy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadcdd53-cde8-48e5-aa82-ae3c818768d5_938x326.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ADXy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadcdd53-cde8-48e5-aa82-ae3c818768d5_938x326.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ADXy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadcdd53-cde8-48e5-aa82-ae3c818768d5_938x326.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ADXy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faadcdd53-cde8-48e5-aa82-ae3c818768d5_938x326.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">A standard A standard five-zone intensity model categorized by maximum heart rate percentages.</figcaption></figure></div><p>Physiology doesn&#8217;t respect these boundaries because physiology isn&#8217;t one-dimensional<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. To better understand what I mean, let&#8217;s take the following scenario: a competitive Nordic skier completes the same 15-minute performance assessment (5-minute ramp, 10-minutes slightly above critical speed) on a ski-treadmill twice, with sessions separated by ~10 weeks. The first time they complete the test, their heart rate climbs from ~140 to ~175 over the first 5-minutes, then they maintain that heart rate (roughly) for the remaining 10-minutes. However, during those 10-minutes their respiratory rate and VO2 steadily climb, reaching near peak values by the end, as depicted below. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!buNX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff29ce385-6d6e-45e8-8451-b0a3b53e74dc_1309x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!buNX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff29ce385-6d6e-45e8-8451-b0a3b53e74dc_1309x941.png 424w, https://substackcdn.com/image/fetch/$s_!buNX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff29ce385-6d6e-45e8-8451-b0a3b53e74dc_1309x941.png 848w, https://substackcdn.com/image/fetch/$s_!buNX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff29ce385-6d6e-45e8-8451-b0a3b53e74dc_1309x941.png 1272w, https://substackcdn.com/image/fetch/$s_!buNX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff29ce385-6d6e-45e8-8451-b0a3b53e74dc_1309x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!buNX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff29ce385-6d6e-45e8-8451-b0a3b53e74dc_1309x941.png" width="642" height="461.5141329258976" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f29ce385-6d6e-45e8-8451-b0a3b53e74dc_1309x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:941,&quot;width&quot;:1309,&quot;resizeWidth&quot;:642,&quot;bytes&quot;:220716,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F219bb8de-f6f4-49ab-8858-6a9ecf0c02c2_1734x985.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!buNX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff29ce385-6d6e-45e8-8451-b0a3b53e74dc_1309x941.png 424w, https://substackcdn.com/image/fetch/$s_!buNX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff29ce385-6d6e-45e8-8451-b0a3b53e74dc_1309x941.png 848w, https://substackcdn.com/image/fetch/$s_!buNX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff29ce385-6d6e-45e8-8451-b0a3b53e74dc_1309x941.png 1272w, https://substackcdn.com/image/fetch/$s_!buNX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff29ce385-6d6e-45e8-8451-b0a3b53e74dc_1309x941.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">An xc-skiers heart rate, respiratory rate, and VO2 during two performance tests separated by ~10 weeks. </figcaption></figure></div><p>Now, let&#8217;s look at the second assessment after ~10 weeks of limitation-based training. We can see that the athlete&#8217;s heart rate kinetics are roughly the same through the duration of the test, but their respiratory rate and VO2 follow a very different trend. Whereas they progressively increased throughout the first assessment, they initially increase, then flatline in the second. Two tests with the <strong>same heart rate</strong>, but profoundly<strong> different overall responses</strong>&#8212;yet, we only see this discontinuity when moving from 1&#8594;3 dimensions. </p><p>The above case highlights how defining zones based on a single biomarker can result in two different physiologic states being lumped together as one (<strong>false positive</strong>). However, defining zones with discrete heart rate thresholds is just as likely to produce <strong>false negatives</strong>. For example, according to the Polar app, cycling at 133 bpm puts me in zone 2, but at 134 bpm I&#8217;m suddenly in zone 3, despite there being negligible physiological differences between the two measurements (and, as mentioned previously, this approach treats all exercise at 134 bpm as being equivalent, regardless of what&#8217;s happening to my ventilatory drive, substrate utilization, or muscle oxygenation)&#8225;. </p><blockquote><p>&#8225;This problem actually compounds during training. You might complete a hard twenty-minute threshold effort where your heart rate stays locked between 168-172 bpm. A conventional five-zone model may classify the entire block as Zone 4. Perfect execution, according to your training app. But your subjective experience may tell a different story. There&#8217;s the moment when the effort first becomes hard, the period where it&#8217;s sustainably hard, and the final minutes where everything starts to unravel. Most athletes can feel this gradation. <strong>Traditional single-metric zoning flattens these experiences into a single category</strong>.</p></blockquote><p>Multi-dimensional zoning allows us to circumvent these issues. With two biomarkers, you can place datapoints on an x-y plane, allowing you to identify regions characterized by high value in one dimension and low in another, for example.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6c6a!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F829de7ca-0721-402a-aefd-6098d89d0fce_2048x868.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6c6a!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F829de7ca-0721-402a-aefd-6098d89d0fce_2048x868.png 424w, https://substackcdn.com/image/fetch/$s_!6c6a!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F829de7ca-0721-402a-aefd-6098d89d0fce_2048x868.png 848w, https://substackcdn.com/image/fetch/$s_!6c6a!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F829de7ca-0721-402a-aefd-6098d89d0fce_2048x868.png 1272w, https://substackcdn.com/image/fetch/$s_!6c6a!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F829de7ca-0721-402a-aefd-6098d89d0fce_2048x868.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6c6a!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F829de7ca-0721-402a-aefd-6098d89d0fce_2048x868.png" width="2048" height="868" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/829de7ca-0721-402a-aefd-6098d89d0fce_2048x868.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:868,&quot;width&quot;:2048,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:424339,&quot;alt&quot;:null,&quot;title&quot;:&quot;Screenshot 2026-02-09 at 1.03.21&#8239;PM.png&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="Screenshot 2026-02-09 at 1.03.21&#8239;PM.png" srcset="https://substackcdn.com/image/fetch/$s_!6c6a!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F829de7ca-0721-402a-aefd-6098d89d0fce_2048x868.png 424w, https://substackcdn.com/image/fetch/$s_!6c6a!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F829de7ca-0721-402a-aefd-6098d89d0fce_2048x868.png 848w, https://substackcdn.com/image/fetch/$s_!6c6a!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F829de7ca-0721-402a-aefd-6098d89d0fce_2048x868.png 1272w, https://substackcdn.com/image/fetch/$s_!6c6a!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F829de7ca-0721-402a-aefd-6098d89d0fce_2048x868.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Two-dimensional zoning along an x-y (heart rate-RR) plane. </figcaption></figure></div><p>With three biomarkers, data points can be spaced along x-y-z axes, providing even more granularity (computationally, we can work in much higher dimensional spaces, though it&#8217;s not possible to visualize anything greater than three). </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!em-d!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39c49a4b-6582-486e-a7ed-07484d66dcf1_2032x786.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!em-d!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39c49a4b-6582-486e-a7ed-07484d66dcf1_2032x786.png 424w, https://substackcdn.com/image/fetch/$s_!em-d!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39c49a4b-6582-486e-a7ed-07484d66dcf1_2032x786.png 848w, https://substackcdn.com/image/fetch/$s_!em-d!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39c49a4b-6582-486e-a7ed-07484d66dcf1_2032x786.png 1272w, https://substackcdn.com/image/fetch/$s_!em-d!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39c49a4b-6582-486e-a7ed-07484d66dcf1_2032x786.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!em-d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39c49a4b-6582-486e-a7ed-07484d66dcf1_2032x786.png" width="2032" height="786" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/39c49a4b-6582-486e-a7ed-07484d66dcf1_2032x786.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:786,&quot;width&quot;:2032,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:546052,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe00af902-f48d-4c2f-9f78-41d4ed360ec4_2032x786.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!em-d!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39c49a4b-6582-486e-a7ed-07484d66dcf1_2032x786.png 424w, https://substackcdn.com/image/fetch/$s_!em-d!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39c49a4b-6582-486e-a7ed-07484d66dcf1_2032x786.png 848w, https://substackcdn.com/image/fetch/$s_!em-d!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39c49a4b-6582-486e-a7ed-07484d66dcf1_2032x786.png 1272w, https://substackcdn.com/image/fetch/$s_!em-d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39c49a4b-6582-486e-a7ed-07484d66dcf1_2032x786.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Three-dimensional zoning alone an x-y-z axis (HR-VO2-RR).</figcaption></figure></div><p>The model my colleagues Frank Kolar, Daniel Crumback, and I have been developing uses a minimum of 3 and maximum of 18 dimensions based on the data a user provides us with us. In the best case scenario, the end-user of our platform uploads heart rate, muscle oxygenation, VO2, respiratory frequency, fat oxidation, and carbohydrate oxidation data. From those six features, we then calculate first and second derivatives, allowing us to capture both absolute rates and their rates of change (we also perform additional feature engineering to reduce measurement noise and prevent any single metric from dominating the analysis due to scale differences)&#8225;.</p><blockquote><p>&#8225;Calculating these derivates allows us to answer questions like "How quickly is heart rate climbing (dHR/dt)? And is muscle oxygen saturation declining, and if so, is that decline accelerating (d&#178;SmO2/dt&#178;)?" During a ramp test or progressive effort, these temporal dynamics reveal metabolic transitions that static measurements miss. For example, an athlete whose heart rate is rising by 2 bpm/minute while SmO2 remains stable is in a different physiological state than one whose heart rate is rising at the same rate while SmO2 is plummeting.</p></blockquote><p>This approach reveals patterns invisible to traditional zoning. For example, let&#8217;s say we have two cyclists with identical max power and max heart rate values who are both completing a ramp test to exhaustion. At 250 watts and 158 bpm, traditional single-metric zoning would classify them identically&#8212;same power, same heart rate, same zone. Multi-dimensional analysis might reveal that Athlete A has SmO2 of 58%, respiratory rate of 28 breaths/min, and RER of 0.89, while Athlete B has SmO2 of 44%, respiratory rate of 38 breaths/min, and RER of 0.97. Despite having the same heart rate and power values, they are not in equivalent physiological states. As a result, they will not receive equivalent training adaptations, and therefore should not be assigned to the same zone.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h3>Problem 2&#8212;Arbitrary Zone Counts</h3><p>For arguments sake, let&#8217;s say we accepted single biomarker zoning as adequate. Even then, the second problem still remains: why five zones? The answer, historically, is "because five seems like a reasonable number." Not too few to be useless and not too many to be unwieldy. </p><p>But, as many experienced coaches have learned, different athletes have different numbers of distinct physiological states, and that number often correlates with both training status and metabolic flexibility (<span class="mention-wrap" data-attrs="{&quot;name&quot;:&quot;Marco Altini&quot;,&quot;id&quot;:37314582,&quot;type&quot;:&quot;user&quot;,&quot;url&quot;:null,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!WksT!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06f6ffc6-abe2-4b01-8a7b-b256fab924eb_1584x1584.jpeg&quot;,&quot;uuid&quot;:&quot;eb02a852-a5d7-4e26-9747-7c2f4cf22c95&quot;}" data-component-name="MentionToDOM"></span> has a nice post that touches on this topic, which you can find <a href="https://marcoaltini.substack.com/p/training-intensity-distribution">here</a>). For example, a recreational runner three months into their first structured training program may only have two or three "gears". Forcing this athlete&#8217;s physiology into five zones means at least two of them are arbitrary subdivisions with little underlying physiological justification. Conversely, a grand tour caliber cyclist with extraordinary metabolic flexibility developed over years of systematic training might have eight or night district physiological states. Compressing this into five zones obliterates meaningful distinctions&#8212;lumping together intensity bands that produce different training adaptations and have different sustainability characteristics.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rncM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe7724220-5fde-4eb3-87a9-e65f12289023_1230x1016.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rncM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe7724220-5fde-4eb3-87a9-e65f12289023_1230x1016.png 424w, https://substackcdn.com/image/fetch/$s_!rncM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe7724220-5fde-4eb3-87a9-e65f12289023_1230x1016.png 848w, https://substackcdn.com/image/fetch/$s_!rncM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe7724220-5fde-4eb3-87a9-e65f12289023_1230x1016.png 1272w, https://substackcdn.com/image/fetch/$s_!rncM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe7724220-5fde-4eb3-87a9-e65f12289023_1230x1016.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rncM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe7724220-5fde-4eb3-87a9-e65f12289023_1230x1016.png" width="612" height="505.5219512195122" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e7724220-5fde-4eb3-87a9-e65f12289023_1230x1016.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1016,&quot;width&quot;:1230,&quot;resizeWidth&quot;:612,&quot;bytes&quot;:464903,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe7724220-5fde-4eb3-87a9-e65f12289023_1230x1016.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rncM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe7724220-5fde-4eb3-87a9-e65f12289023_1230x1016.png 424w, https://substackcdn.com/image/fetch/$s_!rncM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe7724220-5fde-4eb3-87a9-e65f12289023_1230x1016.png 848w, https://substackcdn.com/image/fetch/$s_!rncM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe7724220-5fde-4eb3-87a9-e65f12289023_1230x1016.png 1272w, https://substackcdn.com/image/fetch/$s_!rncM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe7724220-5fde-4eb3-87a9-e65f12289023_1230x1016.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>As you can see from these examples, the arbitrary five-zone model creates two problems simultaneously: it over-partitions beginners (assigning five zones where fewer exist) and under-partitions elites (assigning five zones where many more may exist). The result is a system that&#8217;s simultaneously too coarse for advanced athletes and too granular for beginners, while being perfectly adequate for few. <strong>It&#8217;s a one-size-fits-all solution to what should be individualized</strong>. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h3>Revealing Hidden Structure&#8212;What Optimal Multi-Dimensional Zoning Shows</h3><p>The interaction between these two problems&#8212;single biomarker measurements and arbitrary zone counts&#8212;compounds their individual limitations. Consider a forced five-zone model applied to heart rate data alone. We&#8217;re taking a one-dimensional signal and arbitrarily dividing it into five categories. The zones might have some relationship to underlying physiology (higher heart rates generally indicate higher metabolic demand), but the boundaries are arbitrary percentages of maximum or threshold heart rate.</p><p>This is the problem Frank, Daniel Crumback, and I have been working on solving with our new project <a href="https://assess.works">Assess.Works</a>. Rather than abandoning training zones as a concept, we want the framework to evolve&#8212;from discrete to continuous, from single to multi-dimensional, and most importantly, from arbitrary to data-driven. The core principle is as follows: let unsupervised machine learning identify the natural number of distinct physiological states in an athlete&#8217;s data, rather than imposing a predetermined structure, and then characterize each state using whatever multi-dimensional physiological data is available. In practice, here&#8217;s what the process looks like: first, an athlete uploads their assessment data (typically from a ramp test or progressive effort going from easy to maximal intensity). </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!cXDC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a679232-2565-41b3-88bb-43203834c444_2674x1034.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cXDC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a679232-2565-41b3-88bb-43203834c444_2674x1034.png 424w, https://substackcdn.com/image/fetch/$s_!cXDC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a679232-2565-41b3-88bb-43203834c444_2674x1034.png 848w, https://substackcdn.com/image/fetch/$s_!cXDC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a679232-2565-41b3-88bb-43203834c444_2674x1034.png 1272w, https://substackcdn.com/image/fetch/$s_!cXDC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a679232-2565-41b3-88bb-43203834c444_2674x1034.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cXDC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a679232-2565-41b3-88bb-43203834c444_2674x1034.png" width="2674" height="1034" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2a679232-2565-41b3-88bb-43203834c444_2674x1034.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1034,&quot;width&quot;:2674,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:134047,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F458cd254-798d-46a8-80f7-7618d08a3d14_2674x1034.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!cXDC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a679232-2565-41b3-88bb-43203834c444_2674x1034.png 424w, https://substackcdn.com/image/fetch/$s_!cXDC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a679232-2565-41b3-88bb-43203834c444_2674x1034.png 848w, https://substackcdn.com/image/fetch/$s_!cXDC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a679232-2565-41b3-88bb-43203834c444_2674x1034.png 1272w, https://substackcdn.com/image/fetch/$s_!cXDC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a679232-2565-41b3-88bb-43203834c444_2674x1034.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Then, our system searches for all available physiologic metrics, then loads them for easy charting and analysis. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!H_sh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d6895ce-455d-4bf5-a9be-96e484f585e1_1654x538.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!H_sh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d6895ce-455d-4bf5-a9be-96e484f585e1_1654x538.png 424w, https://substackcdn.com/image/fetch/$s_!H_sh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d6895ce-455d-4bf5-a9be-96e484f585e1_1654x538.png 848w, https://substackcdn.com/image/fetch/$s_!H_sh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d6895ce-455d-4bf5-a9be-96e484f585e1_1654x538.png 1272w, https://substackcdn.com/image/fetch/$s_!H_sh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d6895ce-455d-4bf5-a9be-96e484f585e1_1654x538.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!H_sh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d6895ce-455d-4bf5-a9be-96e484f585e1_1654x538.png" width="1654" height="538" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7d6895ce-455d-4bf5-a9be-96e484f585e1_1654x538.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:538,&quot;width&quot;:1654,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:306600,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd19e8d73-f280-4a2d-bbb4-7f62ce9f562f_1654x538.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!H_sh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d6895ce-455d-4bf5-a9be-96e484f585e1_1654x538.png 424w, https://substackcdn.com/image/fetch/$s_!H_sh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d6895ce-455d-4bf5-a9be-96e484f585e1_1654x538.png 848w, https://substackcdn.com/image/fetch/$s_!H_sh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d6895ce-455d-4bf5-a9be-96e484f585e1_1654x538.png 1272w, https://substackcdn.com/image/fetch/$s_!H_sh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7d6895ce-455d-4bf5-a9be-96e484f585e1_1654x538.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Following that, the system selects certain biomarkers compatible with our zone prediction (whenever are available) model such as heart rate, muscle oxygenation, VO2, and respiratory frequency, substrate oxidation rates. It then engineers features, applies smoothing and standardization, then runs two parallel clustering algorithms:</p><ul><li><p><strong>Forced five-cluster model</strong>: Constrains the data into exactly five zones for comparison with traditional frameworks and communication with coaches trained in conventional systems</p></li><li><p><strong>Optimal clustering model</strong>: Uses unsupervised machine learning to determine the natural number of distinct physiological states present in the data, typically ranging from two to nine clusters depending on athlete development.</p></li></ul><p>Clusters are then ordinally ranked, using an in-house developed algorithm, to ensure logical progression from low to high intensity. Below is an example session where our model identified 10 distinct states during a ramp incremental exercise test including a recovery state (I often use the terms "state", "intensity cluster", and "zone" interchangeably. For all intents and purposes, I mean the same thing in each of these cases). </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6ddq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec844a8-97f3-4030-a575-cc71c81c6293_1280x448.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6ddq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec844a8-97f3-4030-a575-cc71c81c6293_1280x448.png 424w, https://substackcdn.com/image/fetch/$s_!6ddq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec844a8-97f3-4030-a575-cc71c81c6293_1280x448.png 848w, https://substackcdn.com/image/fetch/$s_!6ddq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec844a8-97f3-4030-a575-cc71c81c6293_1280x448.png 1272w, https://substackcdn.com/image/fetch/$s_!6ddq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec844a8-97f3-4030-a575-cc71c81c6293_1280x448.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6ddq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec844a8-97f3-4030-a575-cc71c81c6293_1280x448.png" width="1280" height="448" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ec844a8-97f3-4030-a575-cc71c81c6293_1280x448.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:448,&quot;width&quot;:1280,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:135477,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec844a8-97f3-4030-a575-cc71c81c6293_1280x448.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6ddq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec844a8-97f3-4030-a575-cc71c81c6293_1280x448.png 424w, https://substackcdn.com/image/fetch/$s_!6ddq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec844a8-97f3-4030-a575-cc71c81c6293_1280x448.png 848w, https://substackcdn.com/image/fetch/$s_!6ddq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec844a8-97f3-4030-a575-cc71c81c6293_1280x448.png 1272w, https://substackcdn.com/image/fetch/$s_!6ddq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec844a8-97f3-4030-a575-cc71c81c6293_1280x448.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">A cyclists muscle oxygenation data with each data point color coded based on cluster inclusion.</figcaption></figure></div><p>Looking at the chart above, you&#8217;ll quickly notice that adjacent intensity clusters overlapping ranges for individual metrics (ie., SmO2 data points with the same value can be color-coded by many different states depending on where in the workout they occur). This effect is even more pronounced in the figure below, which shows individual biomarkers data distribution per intensity cluster/zone (in this visual, I&#8217;ve forced the data back into a five-cluster model as described previously). </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Tm1w!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F37e67b1a-0b95-467e-b3cc-3ee86cc3c03b_1594x798.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Tm1w!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F37e67b1a-0b95-467e-b3cc-3ee86cc3c03b_1594x798.png 424w, https://substackcdn.com/image/fetch/$s_!Tm1w!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F37e67b1a-0b95-467e-b3cc-3ee86cc3c03b_1594x798.png 848w, https://substackcdn.com/image/fetch/$s_!Tm1w!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F37e67b1a-0b95-467e-b3cc-3ee86cc3c03b_1594x798.png 1272w, https://substackcdn.com/image/fetch/$s_!Tm1w!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F37e67b1a-0b95-467e-b3cc-3ee86cc3c03b_1594x798.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Tm1w!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F37e67b1a-0b95-467e-b3cc-3ee86cc3c03b_1594x798.png" width="1456" height="729" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/37e67b1a-0b95-467e-b3cc-3ee86cc3c03b_1594x798.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:729,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:637378,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F37e67b1a-0b95-467e-b3cc-3ee86cc3c03b_1594x798.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Tm1w!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F37e67b1a-0b95-467e-b3cc-3ee86cc3c03b_1594x798.png 424w, https://substackcdn.com/image/fetch/$s_!Tm1w!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F37e67b1a-0b95-467e-b3cc-3ee86cc3c03b_1594x798.png 848w, https://substackcdn.com/image/fetch/$s_!Tm1w!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F37e67b1a-0b95-467e-b3cc-3ee86cc3c03b_1594x798.png 1272w, https://substackcdn.com/image/fetch/$s_!Tm1w!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F37e67b1a-0b95-467e-b3cc-3ee86cc3c03b_1594x798.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This is precisely where multi-dimensional analysis proves essential&#8212;and where single-biomarker zoning fails. For example, you can have the same heart rate (say, 145 bpm) assigned to multiple different clusters (recovery, and clusters 1-4 in this case). This overlap isn&#8217;t a flaw. It&#8217;s the system correctly recognizing that physiology is multivariate and that no single metric fully captures metabolic state. While individual metrics may overlap between clusters, their multi-dimensional combinations remain distinct&#8225;. </p><blockquote><p>&#8225;When you examine mean values across clusters, you see the expected progression: mean heart rate and VO2 increase from one zone to the next, while SmO2 typically decreases. But the distributional overlap in any single metric reflects the reality that physiology is complex and multi-factorial.</p></blockquote><p>This is why both problems&#8212;single-metric measurement and arbitrary zone counts&#8212;need to be solved simultaneously. Multi-dimensional analysis without optimal zone counting still imposes artificial structure. Optimal zone counting based on a single metric still misses critical physiological distinctions. The solution requires addressing both.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h3>Interpretable Machine Learning&#8212;Combining Data-Driven Patterns with Objective Physiologic Metrics </h3><p>One of the more interesting implications of data-driven intensity clustering is how it can be combined with objective physiologic thresholds, such as FATmax, critical power, or the metabolic crossover point, among others. After performing multi-dimensional zoning, you can overlay these boundaries where they exist.  You might find, for instance, that FATmax corresponds with a cluster transition. Or you might discover it doesn&#8217;t&#8212;that FATmax occurs in the middle of a stable cluster, but a zone transition happens when FATmax is crossed <em>and</em> muscle oxygen saturation simultaneously drops below a threshold. This reveals that FATmax alone doesn&#8217;t drive a broader multi-dimensional state change in this particular athlete&#8217;s physiology.</p><p><strong>This is interpretable machine learning</strong>. Rather than creating a black box that pronounces mysterious judgments about training zones, we&#8217;re building a system that shows you where objective physiological events occur and lets you identify which combinations of metrics signal meaningful transitions.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!lsKM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9f81a2e-9d78-47e6-9927-58597addfefb_895x713.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!lsKM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9f81a2e-9d78-47e6-9927-58597addfefb_895x713.png 424w, https://substackcdn.com/image/fetch/$s_!lsKM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9f81a2e-9d78-47e6-9927-58597addfefb_895x713.png 848w, https://substackcdn.com/image/fetch/$s_!lsKM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9f81a2e-9d78-47e6-9927-58597addfefb_895x713.png 1272w, https://substackcdn.com/image/fetch/$s_!lsKM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9f81a2e-9d78-47e6-9927-58597addfefb_895x713.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!lsKM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9f81a2e-9d78-47e6-9927-58597addfefb_895x713.png" width="356" height="283.6067039106145" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d9f81a2e-9d78-47e6-9927-58597addfefb_895x713.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:713,&quot;width&quot;:895,&quot;resizeWidth&quot;:356,&quot;bytes&quot;:239440,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F798368e2-2b63-4dc5-8965-e9d5450bc240_904x1194.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!lsKM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9f81a2e-9d78-47e6-9927-58597addfefb_895x713.png 424w, https://substackcdn.com/image/fetch/$s_!lsKM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9f81a2e-9d78-47e6-9927-58597addfefb_895x713.png 848w, https://substackcdn.com/image/fetch/$s_!lsKM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9f81a2e-9d78-47e6-9927-58597addfefb_895x713.png 1272w, https://substackcdn.com/image/fetch/$s_!lsKM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9f81a2e-9d78-47e6-9927-58597addfefb_895x713.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Point-by-point classification provides another means by which you can probe the underling physiology behind predictions. </strong>In contrast to interval-based zoning, Assess.Works uses a continuous zoning model with high resolution. Rather than assigning minutes 10-20 were Zone 4, for example each data point receives its own cluster assignment based on the instantaneous multi-dimensional physiological state. This reveals gradual drift during nominally constant-intensity efforts&#8212;the progressive metabolic stress that athletes subjectively experience but traditional zoning fails to capture. Each cluster can be interrogated: Why is this data point assigned to Cluster 3? Because it exhibits high heart rate combined with suppressed SmO2 and elevated respiratory frequency acceleration&#8212;a specific, interpretable multi-dimensional signature.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EaZ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png" width="48" height="44.44444444444444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:100,&quot;width&quot;:108,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:7779,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179371339?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!EaZ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 424w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 848w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1272w, https://substackcdn.com/image/fetch/$s_!EaZ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9f7070ae-7a5b-4e7b-b9b1-d6ccf1f77d36_108x100.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h2>A New Direction for Human Performance </h2><p>Traditional zoning models have guided effective training for decades, and they&#8217;ll continue to do so. The five-zone model is a useful heuristic, especially for athletes and coaches who don&#8217;t have access to sophisticated physiological monitoring or who prefer simplicity over precision. But, as wearable sensors become more capable and less expensive, as athletes routinely collect data on muscle oxygen saturation, respiratory metrics, and metabolic markers during training, we have an opportunity to move beyond both arbitrary assumptions that limit traditional zoning:</p><ul><li><p><strong>From single-biomarker to multi-dimensional</strong>: Recognize that physiological state can&#8217;t be adequately captured by heart rate or power alone, but requires consideration of multiple interacting variables<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>.</p></li><li><p><strong>From arbitrary five to optimal N</strong>: Acknowledge that different athletes operate across different numbers of distinct metabolic states, and let individual physiology dictate zone structure rather than imposing predetermined categories</p></li></ul><p>The shift is subtle but important. Instead of asking &#8220;what zone am I in?&#8221; based on a single metric and a lookup table, we can ask &#8220;what is my multi-dimensional physiological state?&#8221; and receive an answer grounded in the patterns present in our own data, informed by objective metabolic markers, and interpretable in terms of the specific combination of factors that characterize each state.</p><p>What emerges is a view of training intensity as a continuous, high-dimensional landscape rather than a series of discrete rooms. You don&#8217;t jump from Zone 3 to Zone 4 at some magic heart rate threshold. You transition gradually through a space defined by multiple interacting variables, and the &#8220;zones&#8221; are simply regions of that space where certain combinations of variables tend to cluster together naturally. Sometimes those regions align with traditional five-zone boundaries. Often they don&#8217;t. And that&#8217;s not a failure of the model&#8212;it&#8217;s the model working as intended, revealing structure in the data that simpler frameworks necessarily miss.</p><p>For now, most athletes and coaches will continue using traditional zones because they&#8217;re familiar, because they work well enough, and because the infrastructure of training plans and coaching wisdom is built around them. But as we instrument athletes more completely, as we collect richer data and develop more sophisticated analytical tools, the five-zone single-metric model will increasingly look like what it is: a useful approximation of a phenomenon too complex to fit neatly into five categories defined by one number. <strong>The question isn&#8217;t whether we should abandon zones entirely. It&#8217;s whether we&#8217;re ready to let them evolve.</strong> </p><div><hr></div><p>The team at Assess.Works just finished a major milestone, and are now moving onto the next one: auto-identifying physiological limitations. If you&#8217;re interested in learning more or  know organizations that might benefit from a conversation about our technology and strategic partnership opportunities, we&#8217;d welcome the connection. Feel free to respond to this email, or to reach out directly (<a href="mailto:evanpeikon@gmail.com">evanpeikon@gmail.com</a>).</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>The problem isn&#8217;t that zone-based training models compress physiology from a very high dimensional space to a low dimensional space. It&#8217;s that zone-based training models apply compression arbitrarily rather than in a data-driven manner (a data-driven approach would mean that instead of uniformly using a 3, 5, or 7 zone model, each athlete has an individualized number of zones based on their actual data). For more on this topic, check out<a href="https://sequenceanddestroy.substack.com/p/issue-60-compression-as-understanding"> Issue #60 // Compression As Understanding</a>. </p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>I&#8217;ve written about this topic previously in <a href="https://sequenceanddestroy.substack.com/p/distributed-by-design?utm_source=publication-search">Issue #48: Distributed by Design</a>, which discusses the idea of distributed sensor networks for comprehensive physiological analyses. </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Sequence &amp; Destroy by Evan Peikon is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Issue #66 // Crossing the Proteomics Chasm ]]></title><description><![CDATA[Transitioning from Transcriptomic to Proteomic Data Analysis]]></description><link>https://sequenceanddestroy.substack.com/p/issue-66-crossing-the-proteomics</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-66-crossing-the-proteomics</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Wed, 11 Feb 2026 15:43:48 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/32b56b60-7a9e-4636-a698-6218c5027026_1470x1158.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JRQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" width="1456" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:558245,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/170026527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong>Enjoy this piece? </strong>You can subscribe to have the next post delivered to your inbox:</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://sequenceanddestroy.substack.com/subscribe?"><span>Subscribe now</span></a></p><div><hr></div><h2>Issue &#8470; 66 // Crossing the Proteomics Chasm </h2><p>I recently read an article titled "Why Proteomics is Still Stuck in the Basement" on the Talus Bio blog, and in it  <span class="mention-wrap" data-attrs="{&quot;name&quot;:&quot;Alex Federation&quot;,&quot;id&quot;:41862680,&quot;type&quot;:&quot;user&quot;,&quot;url&quot;:null,&quot;photo_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/32fa4dd0-4dcc-481c-b79d-fcc3df42bb22_3342x3342.jpeg&quot;,&quot;uuid&quot;:&quot;354b5b00-9b80-46b1-8253-f2a51e038864&quot;}" data-component-name="MentionToDOM"></span> and <span class="mention-wrap" data-attrs="{&quot;name&quot;:&quot;Lindsay Pino&quot;,&quot;id&quot;:97211381,&quot;type&quot;:&quot;user&quot;,&quot;url&quot;:null,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad2c1e6d-65b5-47b3-8523-04b196b7b36e_3335x3335.jpeg&quot;,&quot;uuid&quot;:&quot;06219c30-4f25-424e-a70f-45c33656fdd2&quot;}" data-component-name="MentionToDOM"></span> said something that stuck with me:</p><blockquote><p>"My grad school cohort mostly consisted of genomicists and computational biologists, with us proteomicists frequently the outliers, so despite DIA-MS being a Big Deal in my mass spec circles, my departmental research reports mostly got glazed-over eyes and scattered polite claps. Amongst my fellow trainees, the joke was that proteins weren&#8217;t real &#8211; they&#8217;re "gene products" and therefore just a consequence of genomics &#8211; and to an extent, it&#8217;s true. Proteomics has definitely lagged behind genomics, for a variety of reasons." [<a href="https://blog.talus.bio/p/why-proteomics-is-still-stuck-in">source</a>]</p></blockquote><p>Prior to starting lab rotations early in my PhD, almost all of my prior bioinformatics experience was working with bulk, single-cell, and spatial RNA-sequencing data. But, having now spent what&#8212;with some back of the napkin math&#8212;must have amounted to &gt;1,000 hours doing applied proteomics research, I&#8217;ve noticed a subtle shift in my thinking. When I was primarily working with RNA/transcriptional data, proteins were relegated to "gene products", as Alex and Lindsay alluded to, and were almost an afterthought<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. Now, I find myself thinking more along the lines of "a gene is a protein&#8217;s way of making another protein." </p><p>The biggest challenge going from working with transcriptomic to proteomic data for me wasn&#8217;t this mental frameshift though. It was learning the nuances of working with a new type of omics data and understanding that the simplest conventions&#8212;from how to normalize data to specific thresholds to use during differential expression analysis&#8212;can be totally different<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. This is particularly challenging in proteomics, where so much knowledge is platform-specific; as a result there is less standardization of protocols and best practices aren&#8217;t as well codified. Because of this lack of centralized information, becoming proficient in working with proteomics data is a bit like a dark art, handed down from mentor to trainee through spoken word. This can be particularly true, when starting to work with more niche types of proteomic data, like that from reverse protein arrays. </p><p>Most people reading this are probably familiar with RNA-sequencing, which measures mRNA transcript abundance. Reverse phase protein array (RPPA) data on the other hand is less common, but at the simplest level it measures proteins and post-translational modifications, allowing you to assess function and active signaling status (for this reason, RPPA proteomics is often called functional proteomics). Aside from measuring a different layer of biological abstraction, one of the biggest differences between RNA-seq and RPPA data is that in the former case you&#8217;re measuring thousands (even tens of thousands) of genes at a time, whereas in the case of RPPA you&#8217;re typically measuring a few hundred proteins at most (often as little at ~150). Additionally, whereas RNA-seq provides a broad, genome-wide, "snapshot" of transcription, RPPA offers a targeted, high-sensitivity, measurement of specific pre-specified proteins.</p><blockquote><p><strong>Why use RPPA over mass spectrometry-based proteomics?</strong> For my work studying tumor phenotypes and drug vulnerabilities, RPPA offers several  advantages: it provides highly quantitative measurements with excellent reproducibility across large sample cohorts (often hundreds or thousands of samples), has superior sensitivity for detecting phosphorylation events (which are critical for understanding active signaling states), and most importantly, allows for hypothesis-driven interrogation of specific signaling pathways I'm most interested in. While mass spec offers unbiased, proteome-wide coverage, RPPA's targeted approach means I can deeply profile the signaling networks most relevant to cancer biology across many more samples than would be practical with mass spec (in some cases we do have access to both RPPA and mass-spec proteomics data, or mRNA data, but these tend to be less common).</p></blockquote><p>In addition to RPPA measuring a more focused set of proteins, we also need to consider how protein kinetics differ from mRNA. For example, when performing differential expression analysis with RNA-seq data we may be used to seeing a much higher dynamic range of values, and as a result it&#8217;s common to use stricter log2FC filters. As compared to genes, protein levels are often more stable, and have smaller, more consistent, changes in the range of ~20-50%, and as a result lax filters like |log2FC|&#8776;0.26-0.58 can provide meaningful results.</p><p>Where things became more of a challenge for me is when I wanted to start working with RPPA protein co-expression networks, both because there was no existing software available for this, and because so many assumptions from working with gene expression data don&#8217;t hold. For example, when working with RNA-seq data, there are conventional guidelines for performing weighted gene coexpression network analysis, such as selecting an appropriate soft thresholding power to achieve scale-free topology. In WGCNA, you typically test a range of soft threshold powers (often 1-20) and select the lowest power where the scale-free topology fit index (R&#178;) exceeds 0.8 or 0.9, indicating the network approximates a scale-free structure. This soft thresholding approach preserves the continuous nature of gene-gene relationships and emphasizes strong connections while maintaining weaker ones.</p><p>RPPA networks require a different approach. Because RPPA measures a pre-selected panel of functionally related proteins&#8212;typically key signaling nodes and their phosphorylation states&#8212;the network has inherently different topological properties than genome-wide transcriptomic networks. Rather than aiming for scale-free topology, I often use hard thresholding based on correlation strength to identify the most robust, coordinated signaling relationships. This approach is well-suited for RPPA data because&#8230;</p><ol><li><p>With only ~175-250 proteins, we&#8217;re specifically looking for strong, direct signaling relationships rather than the broader, more diffuse co-expression modules typical of transcriptomics; </p></li><li><p>These pre-selected proteins are chosen because they&#8217;re known signaling hubs, so we expect&#8212;and want to capture&#8212;tight coordination between functionally related proteins; and </p></li><li><p>The resulting networks are more interpretable for identifying coordinated signaling architecture, such as kinase cascades or feedback loops, rather than loosely connected modules. The density of these networks reflects the biological reality: these proteins were selected precisely because they work together in known pathways. </p></li></ol><p>Below you&#8217;ll find a sample output from a co-expression analysis software I built: here, each node is a phosphoprotein and edges between them are coordinated relationships. Each node is sized based on it&#8217;s degree centrality, and proteins are color-coded based on their pathway.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FmQi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbdbff80e-8ef7-4550-af60-737236a3dd26_1092x490.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FmQi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbdbff80e-8ef7-4550-af60-737236a3dd26_1092x490.png 424w, https://substackcdn.com/image/fetch/$s_!FmQi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbdbff80e-8ef7-4550-af60-737236a3dd26_1092x490.png 848w, https://substackcdn.com/image/fetch/$s_!FmQi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbdbff80e-8ef7-4550-af60-737236a3dd26_1092x490.png 1272w, https://substackcdn.com/image/fetch/$s_!FmQi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbdbff80e-8ef7-4550-af60-737236a3dd26_1092x490.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FmQi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbdbff80e-8ef7-4550-af60-737236a3dd26_1092x490.png" width="533" height="239.16666666666666" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bdbff80e-8ef7-4550-af60-737236a3dd26_1092x490.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:490,&quot;width&quot;:1092,&quot;resizeWidth&quot;:533,&quot;bytes&quot;:323073,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/186499391?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbdbff80e-8ef7-4550-af60-737236a3dd26_1092x490.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!FmQi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbdbff80e-8ef7-4550-af60-737236a3dd26_1092x490.png 424w, https://substackcdn.com/image/fetch/$s_!FmQi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbdbff80e-8ef7-4550-af60-737236a3dd26_1092x490.png 848w, https://substackcdn.com/image/fetch/$s_!FmQi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbdbff80e-8ef7-4550-af60-737236a3dd26_1092x490.png 1272w, https://substackcdn.com/image/fetch/$s_!FmQi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbdbff80e-8ef7-4550-af60-737236a3dd26_1092x490.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Because I&#8217;m typically working with the same ~175-250 phospho-proteins, it&#8217;s also allowed me to create some cool tools like a Figma pathway map, part of which is depicted below, with all of the proteins and their interactions included. This resource has been extremely valuable when trying to interpret analysis results, as I can cross reference to see if any specific "pathway stories" emerge. For example, I may see that AKT T308, 4E-BP1 S65, and eIF4E S209 are up-regulated. At a quick glance this may not seem as meaningful as seeing PI3K, AKT, and mTOR all up-regulated, but using the pathway map it quickly becomes clear that there is a signal here: phosphorylation at T308 activates AKT, and 4E-BP1 is a downstream effector whose phosphorylation leads to the release, and activation, of eIF4E.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!E5Fi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F759ad3f8-3355-4a56-b5c5-75f91bb7f2ae_1796x918.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!E5Fi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F759ad3f8-3355-4a56-b5c5-75f91bb7f2ae_1796x918.png 424w, https://substackcdn.com/image/fetch/$s_!E5Fi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F759ad3f8-3355-4a56-b5c5-75f91bb7f2ae_1796x918.png 848w, https://substackcdn.com/image/fetch/$s_!E5Fi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F759ad3f8-3355-4a56-b5c5-75f91bb7f2ae_1796x918.png 1272w, https://substackcdn.com/image/fetch/$s_!E5Fi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F759ad3f8-3355-4a56-b5c5-75f91bb7f2ae_1796x918.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!E5Fi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F759ad3f8-3355-4a56-b5c5-75f91bb7f2ae_1796x918.png" width="1456" height="744" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/759ad3f8-3355-4a56-b5c5-75f91bb7f2ae_1796x918.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:744,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:273895,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/186605780?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F759ad3f8-3355-4a56-b5c5-75f91bb7f2ae_1796x918.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!E5Fi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F759ad3f8-3355-4a56-b5c5-75f91bb7f2ae_1796x918.png 424w, https://substackcdn.com/image/fetch/$s_!E5Fi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F759ad3f8-3355-4a56-b5c5-75f91bb7f2ae_1796x918.png 848w, https://substackcdn.com/image/fetch/$s_!E5Fi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F759ad3f8-3355-4a56-b5c5-75f91bb7f2ae_1796x918.png 1272w, https://substackcdn.com/image/fetch/$s_!E5Fi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F759ad3f8-3355-4a56-b5c5-75f91bb7f2ae_1796x918.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Speaking of pathways, this raises another point. When working with RPPA data, traditional pathway enrichment tools fail to perform. Not only do we not have enough proteins for significant enrichment, but because of the specific antibody names and phospho-sites, they are also unrecognized in most databases. To solve this, I&#8217;ve had to create a massive dictionary with each protein, the pathway(s) it belongs to, and whether its phosphorylation is functionally activating or inhibiting. Using this, I&#8217;ve created my own custom enrichment tool that looks at both the protein&#8217;s log2FC and regulatory relationship to interpret the data (for example, a down-regulated inhibitor is functionally activating). This allows for some neat visualizations, like the one below, which combines differential expression and network topology data to create a composite score (y-axis) for a proteins influence in a given condition, and under the x-axis we have color indicators for each proteins pathway membership and functional effect.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!b1Bz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9fc1e42-4b24-4e7c-b06d-20b735693a6a_854x620.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!b1Bz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9fc1e42-4b24-4e7c-b06d-20b735693a6a_854x620.png 424w, https://substackcdn.com/image/fetch/$s_!b1Bz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9fc1e42-4b24-4e7c-b06d-20b735693a6a_854x620.png 848w, https://substackcdn.com/image/fetch/$s_!b1Bz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9fc1e42-4b24-4e7c-b06d-20b735693a6a_854x620.png 1272w, https://substackcdn.com/image/fetch/$s_!b1Bz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9fc1e42-4b24-4e7c-b06d-20b735693a6a_854x620.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!b1Bz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9fc1e42-4b24-4e7c-b06d-20b735693a6a_854x620.png" width="446" height="323.79391100702577" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c9fc1e42-4b24-4e7c-b06d-20b735693a6a_854x620.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:620,&quot;width&quot;:854,&quot;resizeWidth&quot;:446,&quot;bytes&quot;:143607,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/186499391?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33cfca3d-0c93-4ff2-a3d8-bf0fec55dde2_854x620.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!b1Bz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9fc1e42-4b24-4e7c-b06d-20b735693a6a_854x620.png 424w, https://substackcdn.com/image/fetch/$s_!b1Bz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9fc1e42-4b24-4e7c-b06d-20b735693a6a_854x620.png 848w, https://substackcdn.com/image/fetch/$s_!b1Bz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9fc1e42-4b24-4e7c-b06d-20b735693a6a_854x620.png 1272w, https://substackcdn.com/image/fetch/$s_!b1Bz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc9fc1e42-4b24-4e7c-b06d-20b735693a6a_854x620.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I plan to share some of these tools publicly after finishing my PhD as part of an open-source software package for working with RPPA data. In the meantime, I&#8217;m curious to hear from others making similar transitions between omics platforms: What conventions have you established? What unexpected challenges did you encounter? The proteomics community is smaller and more fragmented than genomics, but perhaps that&#8217;s precisely why sharing our solutions and learning from each other&#8217;s experiences is so valuable.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Even when I was working with RNA-sequencing data I knew that it was ultimately the proteins these genes encoded that I cared about. I also knew that mRNA levels are poor predictors of protein abundance, which I previously wrote about in <a href="https://sequenceanddestroy.substack.com/p/issue-64-lossy-compression-what-rna">Issue #64</a>. However, when all you have is gene expression data, you can start to form a gene-centric view of things. For some experimental questions this is okay, but it&#8217;s important to consider the degree of uncertainty we&#8217;re left with when the goal is to understand a tissues phenotype. In these cases, proteins can&#8217;t just be written off as "gene products". Now that I&#8217;ve spent a considerable amount of time working with proteomics data, I&#8217;m much less comfortable with the types of assumptions I used to make, and that I now recognize others making. </p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>For example, the fold-change thresholds I routinely applied to bulk RNA-sequencing data (often |log2FC| &gt; 1 or even &gt; 2) would miss biologically meaningful phospho protein changes entirely. Similarly, the normalization strategies that work well for count-based RNA-seq data don&#8217;t translate to the continuous, antibody-based measurements from reverse phase protein arrays.  </p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Issue #65 // Multiplex Molecular Biosensing ]]></title><description><![CDATA[Thoughts On Continuous Protein Biosensors]]></description><link>https://sequenceanddestroy.substack.com/p/issue-61-multiplex-molecular-biosensing</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-61-multiplex-molecular-biosensing</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Sun, 01 Feb 2026 13:15:40 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/f4bdfe63-5d43-44d7-9aa9-ddf40459d94b_1450x846.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JRQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" width="1456" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:558245,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/170026527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong>Liked this piece?</strong> If so, you&#8217;ll want to check out other articles in Sequence &amp; Destroy&#8217;s <strong><a href="https://sequenceanddestroy.substack.com/t/wearable-technology">Wearable Technology &amp; Biometrics</a></strong> collection. Additionally, you can show your support by tapping the &#128420; in the header above. It&#8217;s a small gesture that goes a long way in helping me understand what you value and in growing this newsletter. Thanks!</p><div><hr></div><h2><strong>Issue </strong>&#8470; 65<strong> // Multiplex M</strong>o<strong>lecular Bi</strong>o<strong>sensing </strong></h2><p>I&#8217;ve been involved in the wearable technology industry for more than a decade now, first as a technical research consultant and scientific advisor to early-stage companies in this space and later as a co-founder at NNOXX. During this time I&#8217;ve seen two parallel trends unfold: the first is a massive cost-reduction for existing sensing modalities, and the second is a significant increase in the number of biomarker measurements that can be obtained with a wearable device. </p><p>I still remember working with the original Omegawave system&#8212;a cumbersome, expensive contraption that felt like a relic from the Soviet cosmonaut training program (in a way, it was). Omegawave was the first commercially available device to measure HRV and DC potential with the explicit goal of predicting "readiness" and was a precursor to Whoop and Oura. Back then, wearable devices were limited to a small number of measurements like heart rate, HRV, blood oxygenation, and skin temperature&#8212;vital signs that any 20th century physician would recognize. Today, we live in a different world entirely with muscle oximeters, portable metabolic analyzers, in ear EEGs, hydration sensors, and more being made available all the time. </p><p>After all of this progress, we&#8217;re approaching a conceptual precipice. The industry is now experiencing what looks to be a natural continuation of these trends with molecular biosensors capable of measuring specific proteins and metabolites in blood and interstitial fluid continuously. Companies are already developing devices that can measure real-time blood lactate, monitor ketone levels throughout the day, and track inflammatory cytokines like IL-6 at regular intervals. Unsurprisingly, VCs have taken notice, funding startups that promise to monitor various proteins with minimally invasive sensors continuously and in real-time. The implicit assumption seems to be that if measuring one molecule&#8212;glucose&#8212;revolutionized diabetes management, then surely measuring other new molecules will unlock similar insights for a wide range of applications.</p><p>But what if this assumption is wrong? What if molecular biosensors aren&#8217;t a natural progression from our current wearable technologies&#8212;what if they represent a category shift that the industry as a whole doesn&#8217;t seem to recognize? Continuous protein measurements and other molecular biomarkers aren&#8217;t just higher-resolution versions of gross physiological metrics like heart rate or blood oxygenation (collectively termed 'digital biomarkers'). They are fundamentally a different kind of measurement that requires a different interpretive framework. What follows are three connected arguments: </p><ul><li><p>First, molecular biomarkers operate according to different principles than the digital biomarkers we&#8217;re accustomed to tracking; </p></li><li><p>Second, current business incentives are driving development toward single-analyte sensors that may not deliver their promised value; and </p></li><li><p>Third, alternative paths&#8212;as opposed to those currently being pursued&#8212;might better match the technology to contexts where its capabilities are most needed.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZLMz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" width="48" height="48" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:800,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:14436,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179722635?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Digital and molecular biomarkers are complementary, but they&#8217;re fundamentally different types of measurements that each require distinct interpretive frameworks. The jump from measuring heart rate to measuring individual proteins isn&#8217;t incremental progress&#8212;it&#8217;s moving from integrated physiological outputs to individual molecular components. More precisely, it&#8217;s crossing layers of biological abstraction&#8212;from emergent system-level phenomena down to the molecular entities that underlie them. From my experience working with both digital and molecular biomarkers independently, as well as developing methods to integrate them via knowledge-graph-based systems, it&#8217;s clear that the industry is attempting to apply a digital biomarker framework to molecular measurements&#8212;and that approach is built on a flawed assumption about how proteins work.</p><p>Consider what happens when you measure Interleukin-6 (IL-6) continuously. IL-6 is commonly described as a pro-inflammatory cytokine, one of the key mediators of fever and the acute phase response. It would seem reasonable to interpret elevated IL-6 as indicating more inflammation. But the biology is more complex than this. IL-6 can signal through two different pathways: classical signaling via membrane-bound receptors tends toward regenerative and anti-inflammatory effects, while trans-signaling through soluble receptors drives chronic inflammation. The same molecule at the same concentration can have entirely different biological meanings depending on context, which I previously wrote about  in <a href="https://sequenceanddestroy.substack.com/p/molecular-moonlighting">Issue #55: Molecular Moonlighting</a>. </p><p>Interleukin-6 isn&#8217;t unique in this regard. All proteins exist in dynamic networks where function emerges from context, relationships, and timing. A kinase that phosphorylates one substrate during cell cycle progression might phosphorylate a different substrate during stress response, creating entirely different downstream effects. The protein hasn&#8217;t changed&#8212;its relational context has. This has implications for how we should think about molecular biosensors. The quantified self movement has largely embraced single-molecule monitoring: track your cortisol for stress, measure your C-reactive protein for inflammation, monitor your glucose for metabolic health. But, proteins are components within networks, not standalone indicators. Measuring a single protein is like measuring the voltage across one resistor in a complex circuit&#8212;the measurement may be accurate, but its meaning depends on the state of the rest of the system.</p><p>This is where molecular biosensors differ from the gross physiological metrics we&#8217;re used to tracking. Heart rate and blood oxygenation are already emergent phenomena that integrate signals from multiple systems. They represent the output of countless molecular interactions, abstracted up to a level where single measurements carry interpretable meaning. A single protein measurement, by contrast, is looking at one node in a network where meaning comes from relationships rather than absolute values.</p><p>Glucose monitoring deserves special consideration here, since it&#8217;s the standard reference point for continuous molecular measurement. Glucose occupies an unusual middle ground between gross physiological metrics and true molecular biomarkers. While glucose is technically a molecule, its behavior in the bloodstream functions more like a systems-level measurement. Glucose concentration reflects the integrated output of multiple regulatory processes&#8212;insulin signaling, hepatic glucose production, peripheral uptake, hormonal counter-regulation&#8212;operating within relatively stable kinetic parameters. The interpretation of a glucose value doesn&#8217;t depend heavily on knowing the concentration of specific enzymes or transporters at that moment; the meaning is largely contained in the glucose value itself, contextualized by simple factors like food intake and activity. This is precisely what makes it tractable as a single-analyte measurement.</p><p>Most proteins don&#8217;t work this way. Their function depends on post-translational modifications, binding partners, subcellular localization, and the expression levels of their targets and regulators&#8212;information that isn&#8217;t captured in a concentration measurement. Glucose monitoring succeeded not because all single-molecule measurements are inherently useful, but because glucose happens to be one of the few molecules whose concentration alone carries robust meaning across most contexts. The practical implication here is that single-analyte molecular biosensors will have limited utility until we can measure enough proteins simultaneously to capture meaningful biological context. The technical achievement of measuring proteins like IL-6 in interstitial fluid is genuinely impressive, but the value proposition may be more limited than current market enthusiasm suggests. At the same time, the longer-term potential once multiplexing becomes feasible may be under-appreciated.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZLMz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" width="48" height="48" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:800,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:14436,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179722635?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Building a multiplex molecular biosensor requires substantial capital, sophisticated engineering, and regulatory pathways that are still being established (and rapidly changing at the time of this writing), creating an interesting challenge for technology development. For a small biotech company, developing a single-analyte sensor is the more tractable path. You can focus on one molecule, establish a clear use case, validate the technology, navigate regulatory approval, and potentially become an acquisition target for established players like Dexcom or Abbott, among others. This pathway is already clear, and it makes business sense. Major continuous monitoring companies have strong incentives to expand into molecular sensing. But there&#8217;s a potential issue with this trajectory. When these technologies get integrated into existing platforms, they&#8217;re likely to be presented as independent metrics&#8212;just another number on the dashboard&#8212;rather than as nodes in a network that require contextual interpretation. This reflects the current business model of these companies, which centers on providing users with actionable numbers they can understand and respond to.</p><p>The alternative would be to develop multiplex sensing from the outset, but this requires investors who understand why single-analyte approaches might not deliver the anticipated value and who are willing to fund a longer, more capital-intensive development path toward a market that doesn&#8217;t yet exist. This is a harder pitch in the current early 2026 funding environment.</p><p>That said, there may be alternative entry points worth considering. Critical care medicine already involves multiplex measurements through continuous monitoring combined with frequent blood tests. Clinicians in intensive care settings are accustomed to thinking about complex physiological states rather than individual metrics. A multiplex interstitial fluid sensor that helps predict complications like sepsis before vital signs change would have clear clinical utility without requiring a shift in how practitioners interpret biological data. Once proven in this context, the technology could potentially scale to consumer applications. This inverts the typical medical device development trajectory, but it might better match the technology to contexts where its value is most apparent.</p><p>One concern about the incremental single-analyte path is that design decisions made early on can constrain future development&#8212;a topic I discussed in <a href="https://sequenceanddestroy.substack.com/p/the-garden-of-technological-possibilities">Issue #48: The Garden of Technological Possibilities</a>. Sensor architecture, regulatory strategy, manufacturing processes, and user interfaces optimized for single measurements may not easily extend to multiplexing. Adding measurements later isn&#8217;t just an engineering challenge; it may require rebuilding substantial portions of the technology stack and rethinking how information is presented to users.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZLMz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" width="48" height="48" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:800,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:14436,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179722635?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>When multiplex molecular biosensors do become practical, they&#8217;ll enable some genuinely new capabilities. Rather than reporting that a single biomarker is elevated, they could describe an individual&#8217;s network state and how that relates to physiological outcomes for that specific person. We could generate personalized protein expression networks, identify which regulatory hubs are most important in an individual&#8217;s biology, and potentially detect transitions between physiological states before conventional metrics change. This represents a form of precision medicine that&#8217;s complementary to genomics&#8212;these networks are dynamic and change with disease progression, treatment, and recovery.</p><p>We&#8217;re not at that point yet. The current development trajectory shows many parallels to early genomics, where there was an assumption that identifying components would quickly translate to understanding systems. The Human Genome Project, for all its triumph, didn&#8217;t immediately yield the therapeutic revolution that many expected&#8212;because knowing the parts list doesn&#8217;t tell you how the machine works. We&#8217;re building increasingly sophisticated tools to measure individual proteins while grappling with the question of how much information a single measurement actually provides when biological function emerges from molecular relationships.</p><p>The question facing the field is not whether we&#8217;ll eventually need multiplex approaches&#8212;the biology makes that clear. The question is whether we&#8217;ll recognize this early enough to avoid spending years and considerable resources developing single-analyte sensors that provide limited actionable information. From where I sit, the current trajectory looks misguided rather than suboptimal. The industry is building toward a capability that glucose monitoring seems to validate, but glucose is the exception that proves the rule. Most proteins require network context for meaningful interpretation, and no amount of sophisticated algorithm development or machine learning will extract signal that isn&#8217;t there in the first place.</p><p>Investors and companies have a choice: continue down the incremental path of single-analyte sensors because it fits established business models and funding timelines, or acknowledge that we&#8217;re working at a different layer of biological abstraction that demands different technological and interpretive approaches from the outset. The technical achievements in molecular sensing are real and impressive. The question is whether we&#8217;re building the right things, or just the things that are easiest to build and sell with our current conceptual frameworks.</p><div><hr></div><p><strong>Thanks for reading!</strong> If you found this post useful, please consider subscribing. I share hands-on computational biology techniques, fresh ways to think about tough problems, and perspectives on a range of related topics. All free, straight to your inbox.</p><div data-component-name="FragmentNodeToDOM"><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://sequenceanddestroy.substack.com/subscribe?"><span>Subscribe now</span></a></p><p></p></div>]]></content:encoded></item><item><title><![CDATA[Issue #64 // Lossy Compression: What RNA-seq Doesn't Tell You About Cellular State]]></title><description><![CDATA[An information-theoretic argument for functional proteomics in research and clinical practice]]></description><link>https://sequenceanddestroy.substack.com/p/issue-64-lossy-compression-what-rna</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-64-lossy-compression-what-rna</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Tue, 27 Jan 2026 13:20:56 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/ec216aeb-2a80-4fb9-8311-989e83e8a33c_1440x846.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JRQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" width="1456" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:558245,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/170026527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong>Liked this piece?</strong> If so, tap the &#128420; in the header above. It&#8217;s a small gesture that goes a long way in helping me understand what you value and in growing this newsletter. Thanks so much!</p><div><hr></div><h2>Issue &#8470; 64 // Lossy Compression: What RNA-seq Doesn&#8217;t Tell You About Cellular State</h2><p>In Manolis Kellis&#8217; course <a href="https://ocw.mit.edu/ans7870/6/6.047/f15/MIT6_047F15_Compiled.pdf">Computational Biology: Genomes, Networks, Evolution</a>, he states, "<em>The most intuitive way to investigate a certain phenotype is to measure the expression levels of functional proteins present at a given time in the cell. However, measuring the concentration of proteins can be di&#64259;cult, due to their varying locations, modifications, and contexts in which they are found, as well as due to the incompleteness of the proteome. mRNA expression levels, however, are easier to measure, and are often a good approximation. By measuring the mRNA, we analyze regulation at the transcription level, without the added complications of translational regulation and active protein degradation, which simplifies the analysis at the cost of losing information.</em>" This notion is easy to lose sight of when doing RNA-seq analysis. But consider what it means to lose information in the Shannon information theoretic sense.</p><p>In Claude Shannon&#8217;s framework, information is about uncertainty reduction. Each measurement collapses the space of possible states a system could occupy. By stopping at the transcriptional level, we deliberately remain uncertain about an entire regulatory layer. The post-transcriptional processes like alternative splicing, protein modifications, localization, and degradation represent additional sources of information that specify the actual functional state of the cell. The mutual information between mRNA and protein levels is often low to moderate, with correlations around 0.4-0.7, as depicted in the images below, meaning substantial uncertainty remains about the actual effector molecules (as a reference, a correlation of 0.5 between mRNA and protein means ~75% of the variance in protein levels remains unexplained by transcriptomics)<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. Some proteins persist for days while their mRNAs turn over in minutes; others are rapidly degraded despite abundant transcripts. Additionally, post-translational modifications can completely alter function without any transcriptional change.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!51bw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d90d72-6141-4c44-a770-17c192f637b8_1924x758.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!51bw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d90d72-6141-4c44-a770-17c192f637b8_1924x758.png 424w, https://substackcdn.com/image/fetch/$s_!51bw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d90d72-6141-4c44-a770-17c192f637b8_1924x758.png 848w, https://substackcdn.com/image/fetch/$s_!51bw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d90d72-6141-4c44-a770-17c192f637b8_1924x758.png 1272w, https://substackcdn.com/image/fetch/$s_!51bw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d90d72-6141-4c44-a770-17c192f637b8_1924x758.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!51bw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d90d72-6141-4c44-a770-17c192f637b8_1924x758.png" width="1456" height="574" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/38d90d72-6141-4c44-a770-17c192f637b8_1924x758.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:574,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:994880,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/180792332?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d90d72-6141-4c44-a770-17c192f637b8_1924x758.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!51bw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d90d72-6141-4c44-a770-17c192f637b8_1924x758.png 424w, https://substackcdn.com/image/fetch/$s_!51bw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d90d72-6141-4c44-a770-17c192f637b8_1924x758.png 848w, https://substackcdn.com/image/fetch/$s_!51bw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d90d72-6141-4c44-a770-17c192f637b8_1924x758.png 1272w, https://substackcdn.com/image/fetch/$s_!51bw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F38d90d72-6141-4c44-a770-17c192f637b8_1924x758.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Left, de Sousa et al., 2009; Global signatures of protein and mRNA expression levels; Right, Vogel &amp; Marcotta, 2012; Insights into the regulation of protein abundance from proteomic and transcriptomic analyses.</figcaption></figure></div><p>This is where functional proteomics comes in, both as a means of getting closer to a tissue&#8217;s phenotype and as a systematic uncertainty reduction tool. Consider HER2-amplified breast cancer. Fluorescence In Situ Hybridization (FISH) can identify gene amplification, giving us a binary answer that reduces uncertainty about genomic state but leaves open whether HER2 drives the tumor&#8217;s biology or predicts therapeutic response. Transcriptomics then answers whether HER2 mRNA is elevated, collapsing another degree of freedom, but leaves open the question of whether HER2 protein is abundant. Mass spectrometry proteomics then confirms protein abundance, but functional activity requires something like a reverse phase protein array (RPPA): a high-throughput antibody-based technique that measures protein expression and post-translational modifications, measuring phosphorylated HER2 states. Each layer answers a yes/no question, progressively narrowing the space of possible cellular states until we understand why, within a HER2-amplified population, only a subset responds to targeted therapy.</p><blockquote><p><strong>A simpler way to think about this is as follows:</strong> before data collection, we don&#8217;t know if a tumor has HER2 gene amplification, whether HER2 gene expression is elevated, or HER2 proteins are abundant and if they are, which sites are phosphorylated, reflecting their activity. Each measurement we take answers one yes or no question. FISH shows HER2 amplification; that&#8217;s one point of uncertainty reduced. RNA-seq shows high HER2 expression&#8212;we have even less uncertainty about the tissues phenotype. But, it&#8217;s not until we move further down the hierarchy that uncertainty is reduced to such a degree that we can make a firm conclusion one way or another<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. </p></blockquote><p>The information flow also runs in reverse. Starting from phospho-HER2 elevation and working back through transcriptomics to genomics reveals mechanism&#8212;and mechanism determines treatment strategy. High HER2 protein resulting from gene amplification signals oncogene addiction, making pathway inhibition viable. High HER2 protein from impaired degradation, without amplification, suggests the protein may not be a driver, favoring HER2 as a target for antibody-drug conjugates rather than kinase inhibition. The phenotype is identical, but the causal path matters.</p><blockquote><p><strong>Note:</strong> Absence of gene amplification doesn&#8217;t definitively rule out oncogene addiction&#8212;elevated HER2 could still drive proliferation through other mechanisms (e.g., autocrine signaling, receptor heterodimerization). However, it shifts the prior probability: without amplification, HER2 is more likely a passenger or consequence of upstream dysregulation rather than the initiating driver. This probabilistic distinction matters for treatment selection&#8212;oncogene addiction predicts durable responses to pathway inhibition, while passenger alterations favor using the protein as a delivery target rather than inhibiting its function. For more on this topic check out <a href="https://sequenceanddestroy.substack.com/p/issue-58-how-to-kill-a-tumor">Issue #58 // How to Kill a Tumor: Three perspectives on rationale drug selection in personalized cancer therapy</a>. </p></blockquote><p>Understanding this through information theory makes explicit what we&#8217;re assuming in our experimental designs. RNA-seq gives us a compressed representation of cellular state&#8212;one that is remarkably productive for discovery, suggesting transcriptional regulation captures much of the cell&#8217;s control logic. But, the practical convenience of RNA-seq shouldn&#8217;t obscure the fundamental gap between what we measure and what actually drives biology. We should recognize when this lossy compression is acceptable and when we need measurements closer to the phenotype itself. In data-driven research, moving down the biological hierarchy reduces uncertainty about what the data mean. In clinical practice, moving up the hierarchy reduces uncertainty about how to intervene. These directions are complementary faces of the same information-theoretic process. </p><p>Data-driven research follows the path from genotype to phenotype, asking &#8220;what is the cellular state?&#8221; to interpret measurements correctly. Clinical practice reverses this, starting from phenotype and tracing back to mechanism, asking &#8220;why is this the state?&#8221; to select interventions. Both navigate the same multilevel system, but the direction of inquiry determines what uncertainty gets resolved&#8212;interpretive uncertainty versus mechanistic uncertainty. Precision medicine requires both: characterizing the actual state (down the hierarchy) and understanding its causal origin (up the hierarchy).</p><div><hr></div><p><strong>Thanks for reading!</strong> If you found this post useful, please consider subscribing. I share hands-on computational biology techniques, fresh ways to think about tough problems, and perspectives on a range of related topics. All free, straight to your inbox.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://sequenceanddestroy.substack.com/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>As Julian Grandvallet-Contreras pointed out <a href="https://www.linkedin.com/feed/update/urn:li:activity:7421905631776903168/">here</a>, mRNA/protein correlations for a given gene-protein pair are tissue specific as well, which speaks to varying degree of mutual information between these measurements. </p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>After publishing this piece, I realized that part of my argument may be misleading if taken out of context. The idea that moving from DNA&#8594;RNA&#8594;protein&#8594;phosphoprotein measurements results in progressive uncertainty reduction is only true to the extend that we&#8217;re trying to understand the tissues phenotype. However, pretend we live in a universe where phosphoprotein sequencing is the default and DNA sequencing is a rarity. In this world, we may observe elevated HER2 family phosphoproteins in breast tumors, raising the question of whether this result from HER2 amplification. In order to reduce uncertainty in this case we&#8217;d move from  phosphoprotein/protein&#8594;RNA&#8594;DNA. As a result, we can&#8217;t say that any one -omics measurement is truer that another, as they answer different questions; thus, the mechanics of uncertainty reduction, through <a href="https://sequenceanddestroy.substack.com/p/issue-61-orthogonal-validation-over">orthogonal measurements</a>, is context specific. </p></div></div>]]></content:encoded></item><item><title><![CDATA[Issue #63 // When Absence Is Information—Missing Data in Proteomics]]></title><description><![CDATA[Distinguishing technical biases from biological signals and handling MCAR vs MNAR appropriately.]]></description><link>https://sequenceanddestroy.substack.com/p/issue-63-when-absence-is-informationmissing</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-63-when-absence-is-informationmissing</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Sun, 18 Jan 2026 18:13:10 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/13d72f8c-1b6a-44b9-81f2-75f506c4a1d1_936x964.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JRQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" width="1456" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:558245,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/170026527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong>Enjoy this piece? </strong>Whow your support by tapping the &#128420; in the header above. It&#8217;s a small gesture that goes a long way in helping me understand what you value and in growing this newsletter. Thanks!</p><div><hr></div><h2>Issue &#8470; 63 // When Absence Is Information&#8212;Missing Data in Proteomics</h2><p>I first started coding seriously during COVID-19 lockdowns, shortly after co-founding a wearable technology company, NNOXX. At the time my primary interest was analyzing time-series physiological data, which quickly evolved into building ML-based models to predict fatigue status, MSK injury risk, and other factors using distributed sensor networks. Early into my PhD though, my interests shifted from applied machine learning&#8212;building and deploying models for tasks like prediction, decision making, and pattern recognition&#8212;to data-driven research, specifically focusing on computational cancer biology. With this transition came unanticipated second-order effects. </p><p>When I was focusing on ML, I spent a lot of time thinking about how to prepare and preprocess data in such a way as to best exposure the underlying structure of a given prediction problem to an algorithm to achieve the best performance. Then, when my attention shift to research, I found myself thinking more about the nuances of how to best clean data to preserve biological signal without introducing noise or spurious association in down stream analysis. In theory, these two things aren&#8217;t so different&#8212;after all, data cleaning is data cleaning&#8212;but, as I&#8217;ve recently come full circle something clicked for me. Perhaps this is obvious to some of you reading, but there are different constraints and tradeoffs when cleaning or preparing a given dataset for applied ML and data-driven research. In this piece I&#8217;m going to discuss some of those differences as it applies to working with proteomic data, specifically in how we handle missing data. But, before getting into the weeds I want to back up and clarify what I mean when I say data cleaning. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZLMz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" width="48" height="48" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:800,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:14436,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179722635?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>At the simplest level, data cleaning is the process of identifying and correcting systematic errors in your data. For example, data cleaning could include correcting mistyped data, removing corrupted, missing, or duplicate data points, and sometimes adding missing data values back in. Data cleaning may also include tasks such as encoding categorical variables into binary variables (i.e., turning no/yes into 0/1). For what may seem like a rote task, data cleaning actually requires a fairly high degree of domain expertise. For example, let&#8217;s say you&#8217;re working with a dataset where vO2 (volume of oxygen consumption) is one of your features and you notice a row where the value is 92 ml/kg/min. While not impossible, as a subject matter expert you may realize how improbably this value is, whereas someone without that expertise may overlook it. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!siTf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F216f7cda-c83d-49aa-9a86-867815f6ab7b_1300x428.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!siTf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F216f7cda-c83d-49aa-9a86-867815f6ab7b_1300x428.png 424w, https://substackcdn.com/image/fetch/$s_!siTf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F216f7cda-c83d-49aa-9a86-867815f6ab7b_1300x428.png 848w, https://substackcdn.com/image/fetch/$s_!siTf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F216f7cda-c83d-49aa-9a86-867815f6ab7b_1300x428.png 1272w, https://substackcdn.com/image/fetch/$s_!siTf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F216f7cda-c83d-49aa-9a86-867815f6ab7b_1300x428.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!siTf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F216f7cda-c83d-49aa-9a86-867815f6ab7b_1300x428.png" width="1300" height="428" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/216f7cda-c83d-49aa-9a86-867815f6ab7b_1300x428.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:428,&quot;width&quot;:1300,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:94305,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/184954553?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F216f7cda-c83d-49aa-9a86-867815f6ab7b_1300x428.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!siTf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F216f7cda-c83d-49aa-9a86-867815f6ab7b_1300x428.png 424w, https://substackcdn.com/image/fetch/$s_!siTf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F216f7cda-c83d-49aa-9a86-867815f6ab7b_1300x428.png 848w, https://substackcdn.com/image/fetch/$s_!siTf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F216f7cda-c83d-49aa-9a86-867815f6ab7b_1300x428.png 1272w, https://substackcdn.com/image/fetch/$s_!siTf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F216f7cda-c83d-49aa-9a86-867815f6ab7b_1300x428.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Similarly, when looking at missing values in proteomic data, you may spot a pattern to the missingness, which can give clues as to whether these values should be dropped, whether you can impute values in, or whether you should leave them be. For example, you may notice that in a dataset of 500 breast cancer patients there are NaN values for the protein ERBB2 across ~15% of the samples. Upon closer inspection, you may see that those samples correspond to patients classified as HR+HER2- and HR-HER2-, suggesting that missingess is due to protein expression being below the detection threshold. Alternately, suppose you saw these same ERBB2 values missing, but with no clear separation by patient classification. In this case, maybe those NaN samples happened to correspond to patient samples (stored in rows) where many of the corresponding protein columns are blank. This might suggest that there was not enough tumor sample to process, and as a result this type of "missingeness" is technical, not biological. </p><p>Notice that identifying the cause of missing data isn&#8217;t just about looking at the data and finding patterns. You also have sufficiently understand the experimental context, the underlying biology, and potential sources of technical failure, whether during biological sample acquisition or sequencing. It&#8217;s for these reasons that I don&#8217;t advise fully automating data cleaning or outsourcing it to coding agents. While they may better understanding the statistical nature of your dataset, they often lack the context dependence to (a) understand the cause of missingness and (b) how to address it in a specific problem scenario. Importantly, these are two different skills; understanding the biological of your experimental condition and the sequencing technology generating your data allows you to solve the first problem. But, in order to figure out how to proceed after figuring out why your data is missing you need to understand the assumptions of the analyses you plan to perform. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZLMz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" width="48" height="48" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:800,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:14436,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179722635?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Unlike missing data in many other contexts, missingness in proteomics is seldom random, and when it is it&#8217;s often hard to identify. Additionally, the appropriate strategy for handling different types of missingness differs between machine learning applications (where the goal is prediction) and data-driven research (where the goal is biological interpretation). This creates a tension between best practices for avoiding data leakage during ML applications (which typically means handling missing data after splitting the dataset) and the practical realities of working with proteomics data. In proteomics experiments, missing values typically result from one of two sources:</p><ul><li><p><strong>Missing completely at random (MCAR):</strong> Some samples may have a high proportion of missing protein measurements (e.g., &gt;50% of proteins unmeasured) due to insufficient tumor material, poor sample quality, or technical failures during sample processing. These samples represent failed experiments rather than a biological signal and should be removed from the dataset entirely before model development if the goal is prediction (this removal is a quality control step and does not constitute data leakage).</p></li><li><p><strong>Missing not at random (MNAR):</strong> Individual proteins may be missing in specific samples because their expression levels fall below the instrument&#8217;s detection limit. This is a common source of missingness in proteomics data and is informationally rich&#8212;the absence of a measurement often indicates very low or absent protein expression, which may be biologically and clinically relevant. For example, the absence of a measurable signal for an oncogene in a tumor sample might indicate good prognosis.</p></li></ul><p>For machine learning and predictive modeling applications a standard approach is to dealing with for MNAR-based missingness is to impute missing values with a small fraction of the protein-specific minimum detected value (typically 10% of the minimum for linear-scale data, or minimum minus 1-2 units for log-transformed data). This approach is biologically justified because it represents a plausible lower bound for expression (the protein is present but below detection) and preserves the biological information that "this protein is very low in this sample". Additionally, this approach is protein-specific, respecting that different proteins have different detection ranges and avoids the assumption that missing = zero, which would often be an incorrect interpretation. For log-transformed proteomics data (such as log2-scaled measurements), imputing with the protein-specific minimum minus 1.0 in log space corresponds to expression approximately half as abundant as the minimum detected level in linear space, which represents a biologically reasonable lower bound. </p><blockquote><p><strong>Did you find this piece helpful?</strong> Subscribe for free to have the next post delivered to your inbox:</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://sequenceanddestroy.substack.com/subscribe?"><span>Subscribe now</span></a></p></blockquote><p>The challenge for machine learning workflows is that this imputation ideally should occur before train-test splitting, which appears to violate the principle of preventing information leakage from test to training data. However, this pre-split imputation is widely implemented in the proteomics biomarker literature for several reasons. For starters, the imputation is based on biological knowledge (below detection = very low) rather than statistical properties of the full dataset. Additionally, the calculation uses only protein-specific minima, not cross-sample statistics that would truly constitute leakage. Most importantly though, attempting to impute separately within train/validation/test splits can lead to different imputation schemes for the same protein across splits, which is biologically inconsistent.</p><p>Of course, there is an alternative to imputation. We could also just remove all proteins with any missing values. The problem with this approach is that in most cases it would eliminate nearly the entire dataset as some degree of below-detection missingness is nearly universal in proteomics. As a result, for machine learning applications, I often use the following three stage approach:</p><ul><li><p><strong>Stage 1:</strong> Remove samples with &gt;20% missing proteins (quality control for MCAR) before splitting. This stage removes samples with insufficient tumor material or severe technical failures, excluding poor-quality samples that don&#8217;t represent successful experiments.</p></li><li><p><strong>Stage 2:</strong> Remove proteins with &gt;20% missing values before splitting. After removing poor-quality samples, this stage identifies and removes proteins that are frequently unmeasured across the remaining high-quality samples. Proteins missing in more than 20% of samples likely reflect unreliable assays or technical measurement issues (MCAR) rather than biological signals. This ensures that the feature set consists only of proteins that can be reliably measured (after removing bad samples, high missingness in a protein indicates a fundamental measurement problem rather than sample-specific issues).</p></li><li><p><strong>Stage 3:</strong> Impute remaining sparse missing values (representing MNAR below-detection measurements) before splitting. At this point, remaining missing values are sparse (typically &lt;10% per protein) and likely represent true biological phenomena where specific samples have expression below the detection limit. For log-transformed data, impute with the protein-specific minimum log value minus 1.0 (corresponding to half the minimum expression in linear space). For linear-scale data, use 10% of the protein-specific minimum. While this technically uses information from the full dataset, it is justified by the biological nature of the imputation and the practical impossibility of the alternative (we can&#8217;t train models with missing values).</p></li></ul><p>For hypothesis-driven research and differential expression analysis (DEA), the approach to handling missingness is fundamentally different. DEA methods are specifically designed to handle missing values appropriately, and many can accommodate proteins with missingness in a subset of samples. Importantly, because the goal is to explain biology rather than build a predictive model, imputation is often undesirable&#8212;introducing imputed values can obscure true biological patterns and create artificial signals. Instead, a Perseus-style filtering approach is typically preferred: remove proteins with excessive missingness (e.g., missing in &gt;70% of samples within a condition or across all samples, for example) while retaining proteins with moderate missingness that still provide biological information. This preserves the interpretability of the results while ensuring adequate data quality.</p><p>For normal data-driven analyses with continuous expression data, I handle zeros (representing samples with expression too low to detect) differently depending on context. When I have untransformed data and I&#8217;m confident that zeros represent true below-detection expression rather than technical artifacts, I typically impute these with 10% of the protein-specific minimum value. However, if the source of missingness is ambiguous&#8212;I cannot distinguish whether it results from biological low expression or technical failure&#8212;I do not impute. Similarly, if the data has already been transformed before I receive it, I avoid imputation since I lack the information needed to apply biologically appropriate imputation strategies.</p><p>Regardless of how you handle this approach you have to be clear in documenting your rationale, explaining all three types of missingness (sample-level MCAR, protein-level MCAR, and MNAR) and justifying your choice of imputation versus filtering based on your analytical goals. For machine learning applications, You should also justify the pre-split imputation based on the MNAR nature of below-detection measurements and acknowledge that this represents a compromise between statistical ideals and biological realities . When possible, it&#8217;s also helpful perform sensitivity analyses showing that your results are robust to alternative imputation strategies. For other omics data types (transcriptomics, metabolomics), similar principles apply, though the specific imputation strategies may differ based on the measurement technology and the meaning of missingness in that context. The key is always to understand the biological and technical sources of missingness in your specific dataset and to choose imputation strategies that respect the underlying biology while minimizing the risk of introducing spurious patterns that could compromise model generalization.</p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Sequence &amp; Destroy by Evan Peikon is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Issue #62 // Collider Bias In Genomics]]></title><description><![CDATA[Disentangling statistical artifacts from true biological effects]]></description><link>https://sequenceanddestroy.substack.com/p/issue-62-collider-biases-in-proteomics</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-62-collider-biases-in-proteomics</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Sun, 11 Jan 2026 18:38:28 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/2420b995-1b17-40a2-ba0f-2582939f7e8e_1432x726.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JRQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" width="1456" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:558245,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/170026527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong>Did you find this piece helpful?</strong> If so, tap the &#128420; in the header above. It&#8217;s a small gesture that goes a long way in helping me understand what you value and in growing this newsletter. You can also subscribe for free to have the next post delivered to your inbox:</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://sequenceanddestroy.substack.com/subscribe?"><span>Subscribe now</span></a></p><div><hr></div><h2>Issue &#8470; 62 // Collider Biases In Proteomics Research </h2><p>This past week my friend Tommy Tang shared a post on collider biases in genomics research. Collider bias is a statistical distortion that occurs when a variable that is a common effect (a "collider") of two other variables is inadvertently controlled for, creating a  spurious association between those two variables. Tommy&#8217;s post got me thinking, as it seemed like a potential explanation for something I recently observed in my own research. </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!UTOc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a282cff-1355-462d-8923-938e36014ba4_812x690.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UTOc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a282cff-1355-462d-8923-938e36014ba4_812x690.png 424w, https://substackcdn.com/image/fetch/$s_!UTOc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a282cff-1355-462d-8923-938e36014ba4_812x690.png 848w, https://substackcdn.com/image/fetch/$s_!UTOc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a282cff-1355-462d-8923-938e36014ba4_812x690.png 1272w, https://substackcdn.com/image/fetch/$s_!UTOc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a282cff-1355-462d-8923-938e36014ba4_812x690.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UTOc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a282cff-1355-462d-8923-938e36014ba4_812x690.png" width="482" height="409.58128078817737" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5a282cff-1355-462d-8923-938e36014ba4_812x690.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:690,&quot;width&quot;:812,&quot;resizeWidth&quot;:482,&quot;bytes&quot;:238619,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/184130848?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1733463e-35d8-4d61-8bfc-424dc3c72b92_812x690.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!UTOc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a282cff-1355-462d-8923-938e36014ba4_812x690.png 424w, https://substackcdn.com/image/fetch/$s_!UTOc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a282cff-1355-462d-8923-938e36014ba4_812x690.png 848w, https://substackcdn.com/image/fetch/$s_!UTOc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a282cff-1355-462d-8923-938e36014ba4_812x690.png 1272w, https://substackcdn.com/image/fetch/$s_!UTOc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5a282cff-1355-462d-8923-938e36014ba4_812x690.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><a href="https://www.linkedin.com/posts/&#127919;-ming-tommy-tang-40650014_1-your-genomics-analysis-is-wrong-because-activity-7413946145443495936-zMa4/">Source</a> </figcaption></figure></div><p>First, to give some background, androgen receptor S650, estrogen receptor alpha, and cyclin D1 expression have previously been identified as biomarkers of global treatment resistance among high risk breast cancer patients as reported by Gallagher et al., in <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC10772394/">Protein signaling and drug target activation signatures to guide therapy prioritization: Therapeutic resistance and sensitivity in the I-SPY 2 Trial</a>. Specifically, when comparing responders versus non-responders in the ISPY-2 trial participant population&#8212;where responder is defined by achieving pathologic complete response (pCR) following treatment&#8212;these proteins were both up-regulated in non-responders and associated with an increased risk of recurrence (shorted distance recurrence free survival). </p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!e9vJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb21e3d26-ac9d-4aba-8bf8-d4b63da3ca32_1278x470.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!e9vJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb21e3d26-ac9d-4aba-8bf8-d4b63da3ca32_1278x470.png 424w, https://substackcdn.com/image/fetch/$s_!e9vJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb21e3d26-ac9d-4aba-8bf8-d4b63da3ca32_1278x470.png 848w, https://substackcdn.com/image/fetch/$s_!e9vJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb21e3d26-ac9d-4aba-8bf8-d4b63da3ca32_1278x470.png 1272w, https://substackcdn.com/image/fetch/$s_!e9vJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb21e3d26-ac9d-4aba-8bf8-d4b63da3ca32_1278x470.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!e9vJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb21e3d26-ac9d-4aba-8bf8-d4b63da3ca32_1278x470.png" width="1278" height="470" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b21e3d26-ac9d-4aba-8bf8-d4b63da3ca32_1278x470.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:470,&quot;width&quot;:1278,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:509437,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/184130848?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff6c187ce-b649-4bc9-a27d-ec51b56cafe6_1278x470.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!e9vJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb21e3d26-ac9d-4aba-8bf8-d4b63da3ca32_1278x470.png 424w, https://substackcdn.com/image/fetch/$s_!e9vJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb21e3d26-ac9d-4aba-8bf8-d4b63da3ca32_1278x470.png 848w, https://substackcdn.com/image/fetch/$s_!e9vJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb21e3d26-ac9d-4aba-8bf8-d4b63da3ca32_1278x470.png 1272w, https://substackcdn.com/image/fetch/$s_!e9vJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb21e3d26-ac9d-4aba-8bf8-d4b63da3ca32_1278x470.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC10772394/">Source</a></figcaption></figure></div><p>An important note is that despite the ISPY-2 trial being limited to high risk breast cancer, the patient population in the study is fairly heterogeneous, including a mix of patients with HR+/HER2-, HR-/HER2-, HR+/HER2+, and HR-/HER2+ tumors<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. Among these HR/HER2 status-based cohorts in the ISPY-2 trial, the HR+/HER2- group had the lowest pCR rate<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>, and as a result a significant fraction of non-responders in the overall population (approximately ~45%) came from this group.  So, it was to my surprise when I was working with reverse phase protein array data from this study, available on the Gene Expression Omnibus via accession number GSE196093, that I found that androgen receptor S650, androgen receptor total, and estrogen receptor alpha S118 were positive prognostic markers in the treatment-refractory HR+/HER2- patient population, with increased expression of all three proteins being associated with a lower risk of recurrence, as demonstrated by the survival curves below.</p><blockquote><p>To make it more explicit, here we have a case where AR and ER are up-regulated in non-responders versus responders and are demonstrated to be negative prognostic markers across then entire patient population (i.e., associated with worse survival outcomes). However, within the non-responder patient population, these same proteins are positive prognostic markers and are associated with more favorable survival outcomes. </p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!WdB-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff05d7a29-1475-4b9f-88d6-f23c1463c254_1600x345.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!WdB-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff05d7a29-1475-4b9f-88d6-f23c1463c254_1600x345.png 424w, https://substackcdn.com/image/fetch/$s_!WdB-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff05d7a29-1475-4b9f-88d6-f23c1463c254_1600x345.png 848w, https://substackcdn.com/image/fetch/$s_!WdB-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff05d7a29-1475-4b9f-88d6-f23c1463c254_1600x345.png 1272w, https://substackcdn.com/image/fetch/$s_!WdB-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff05d7a29-1475-4b9f-88d6-f23c1463c254_1600x345.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!WdB-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff05d7a29-1475-4b9f-88d6-f23c1463c254_1600x345.png" width="1456" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f05d7a29-1475-4b9f-88d6-f23c1463c254_1600x345.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!WdB-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff05d7a29-1475-4b9f-88d6-f23c1463c254_1600x345.png 424w, https://substackcdn.com/image/fetch/$s_!WdB-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff05d7a29-1475-4b9f-88d6-f23c1463c254_1600x345.png 848w, https://substackcdn.com/image/fetch/$s_!WdB-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff05d7a29-1475-4b9f-88d6-f23c1463c254_1600x345.png 1272w, https://substackcdn.com/image/fetch/$s_!WdB-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff05d7a29-1475-4b9f-88d6-f23c1463c254_1600x345.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Now, because cancer is a highly heterogeneous disease, we do also need to consider the fact that the prognostic value of androgen and estrogen receptor expression depends on disease context and molecular subtype. Previous survival analyses have shown that AR acts as a tumor suppressor in ER-positive breast cancers, making it a favorable prognostic marker, and a tumor promoter in ER-negative breast cancers, including both ER-/HER2+ and triple negative breast cancer, making it a poor prognostic factor in these contexts (<a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC8774219/">You et al., 2022)</a>. This biological heterogeneity could explain part of what we&#8217;re observing: the HR+/HER2- non-responders who express higher AR/ER may represent a biologically distinct subgroup where these receptors retain their tumor-suppressive functions despite treatment resistance in this population. This raises an important question: is the observed association in HR+/HER2- non-responders spurious (collider bias) or real, but context-dependent (effect modification)? Can it be both?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!mIUy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeb9fadd-6b32-4498-84b6-c3125344b9f6_1384x656.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!mIUy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeb9fadd-6b32-4498-84b6-c3125344b9f6_1384x656.png 424w, https://substackcdn.com/image/fetch/$s_!mIUy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeb9fadd-6b32-4498-84b6-c3125344b9f6_1384x656.png 848w, https://substackcdn.com/image/fetch/$s_!mIUy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeb9fadd-6b32-4498-84b6-c3125344b9f6_1384x656.png 1272w, https://substackcdn.com/image/fetch/$s_!mIUy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeb9fadd-6b32-4498-84b6-c3125344b9f6_1384x656.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!mIUy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeb9fadd-6b32-4498-84b6-c3125344b9f6_1384x656.png" width="596" height="282.4971098265896" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/aeb9fadd-6b32-4498-84b6-c3125344b9f6_1384x656.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:656,&quot;width&quot;:1384,&quot;resizeWidth&quot;:596,&quot;bytes&quot;:754197,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/184130848?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2390b3c9-1bd9-4fcd-8d81-ba47cfaee554_1384x660.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!mIUy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeb9fadd-6b32-4498-84b6-c3125344b9f6_1384x656.png 424w, https://substackcdn.com/image/fetch/$s_!mIUy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeb9fadd-6b32-4498-84b6-c3125344b9f6_1384x656.png 848w, https://substackcdn.com/image/fetch/$s_!mIUy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeb9fadd-6b32-4498-84b6-c3125344b9f6_1384x656.png 1272w, https://substackcdn.com/image/fetch/$s_!mIUy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faeb9fadd-6b32-4498-84b6-c3125344b9f6_1384x656.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The mechanisms of action of AR in breast cancers depend on the disease sub-type (<a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC8774219/">source</a>)</figcaption></figure></div><p>The answer to the above question is yes. Collider bias and real effect modification are not mutually exclusive. Collider bias is a statistical structure issue&#8212;conditioning on treatment response can induce spurious associations or amplify existing associations. Effect modification, in contrast, reveals a true biological signal. In this case, the causal effect of androgen and estrogen receptor expression genuinely differs by context (breast cancer subtype, tumor stage, treatment arm, etc.), suggesting effect-modification is in play. However, this doesn&#8217;t mean that a collider bias isn&#8217;t also present. We know that AR/ER have different prognostic effects in hormone receptor positive (HR+) versus hormone receptor negative (HR-) breast cancer, but by conditioning on treatment response (pCR status in this case), we&#8217;re selecting a non-random subset of patients. This selection process can still induce or amplify associations beyond what the true biological effect would predict. As a result, even where there&#8217;s a real context-dependent biological effect, collider bias can distort the magnitude of effect we observe in the conditioned subset. Thus, the association we observe might be partly real (effect modification) and partly artifactual (collider bias).</p><p>This raises the question, how do we determine the degree to which what we&#8217;ve observing is collider bias versus real effect? One way to do this is to analyze the protein-outcome association separately within each HR/HER2 subtype in the overall population, then compare these associations with what you observe when you condition on non-responders within each HR/HER2 subtype individually. In my case, I first looked at whether AR/ER are positive or negative prognostic markers within each HR/HER2 subtype population overall. This yielded the following results:</p><ul><li><p>Overall HR+/HER2- population &#8594; AR/ER = positive prognostic </p></li><li><p>Overall HR-/HER2- population &#8594;  AR/ER = negative prognostic </p></li><li><p>Overall HR-/HER2+ population &#8594; AR/ER = negative prognostic </p></li><li><p>Overall HR+/HER2+ population &#8594; AR/ER = no significant associations</p></li></ul><p>Next, I did a non-responder only analysis within each of these subtypes:</p><ul><li><p>HR+/HER2- Non-responders only &#8594; AR/ER = positive prognostic </p></li><li><p>HR-/HER2- Non-responders only &#8594; AR/ER expression is still unfavorable</p></li><li><p>HR-/HER2+ Non-responders only &#8594; AR/ER expression is still unfavorable</p></li><li><p>HR+/HER2+ Non-responders only &#8594; still no significant associations</p></li></ul><p>Notice, the directional associations did not change after conditioning, which suggests that the biological effect we&#8217;re observing is real. In other words, androgen and estrogen receptor expression are actually associated with better survival outcomes in the treatment-refractory HR+/HER2- patient population. </p><p>To further parse this effect we can also look to see if there is a negative correlation between androgen and estrogen receptor expression and other predictors of non-response, even if these predictors are positively correlated with androgen and estrogen receptor expression in the overall study population. The logic here is that when you condition on a collider (treatment non-response), you create ratification negative correlation between the causes of that collider, even if they are positive or independent in the overall population. Specifically, when conditioning on non-response, we should expect to see that non-responders (patients with high AR/ER expression in this case) will tend to have lower levels of other non-response predictors such as Ki67 levels<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>. When I tested this with the aforementioned dataset, this does not appear to be the case. Instead, there is no significant association between Ki67 and treatment response in the HR+/HER2- population. </p><p>So, back to Tommy&#8217;s original post, understanding collider biases and how to disentangle statistical artifacts from true biological signals in an important skill for computational biologists. This is even more so in computational cancer biology, where context-dependence is the norm.  We&#8217;re constantly faced with scenarios where a given protein acts as a tumor promoter in one context but a suppressor in another, or where a protein signature predicts treatment response in one molecular subtype but non-response in another. As a result, we need to pay careful attention to how studies are designed, what variables are conditioned on (or not), and how our analytical choices may distort true biological signals or introduce spurious associations. The I-SPY 2 dataset is a great testbed for these types of analyses because of its rich omics and clinical data, but we&#8217;re rarely so fortunate in practice. In most cases, we  lack comprehensive clinical outcome data, detailed patient stratification, or sufficient demographic information. As a result, we need to tread carefully when interpreting our data, assess whether our findings align with the broader literature, and temper our confidence when we lack the data needed to properly disentangle collider bias from effect modification.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZLMz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" width="48" height="48" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:800,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:14436,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179722635?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>HR/HER2 status-based subtypes are themselves heterogenous groups, prompting the creation of response predictive subtypes (RPS) that better predict treatment outcomes than HR/HER2 status alone, as demonstrated in <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC9426306/">Redefining breast cancer subtypes to guide treatment prioritization and maximize response: Predictive biomarkers across 10 cancer therapies</a>. In this paper Wolf and colleagues identified five RPS subtypes using immune activation signatures, DNA repair deficiency scores, and HER2/luminal phenotypes, resulting in three HER2 negative classifications (HER2-/Immune-/DRD-, HER2-/Immune+, and HER2-/Immune-/DRD+) and two HER2 positive classifications (HER2+/BP-Luminal and HER2+/BP-HER2_or_Basal). </p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Despite the HR+/HER2- group having the lowest pCR rate, and being associated with a higher residual cancer burden after treatment, the HR-/HER2- (TNBC) group had the shortest distance recurrence free survival. </p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Ki67 is the percentage of cells that are actively dividing and is therefore used as a proliferation index. A high Ki67 level generally corresponds to a worse prognosis since higher Ki67 indicates a more aggressive cancer. However, because chemotherapies target rapidly dividing cells, high Ki67 often predicts better response to chemotherapy as it reflects a tumor&#8217;s innate vulnerability to treatment. </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Sequence &amp; Destroy by Evan Peikon is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Issue #61 // Orthogonal Validation Over Replication ]]></title><description><![CDATA[The biology we study is constrained by the biology we can measure. The solution to this problem isn&#8217;t more reproducibility&#8212;It&#8217;s orthogonal validation using different measurement approaches.]]></description><link>https://sequenceanddestroy.substack.com/p/issue-61-orthogonal-validation-over</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-61-orthogonal-validation-over</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Sun, 04 Jan 2026 13:57:46 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/32d2c646-c73d-4ea0-9928-1de21b627c66_1446x850.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JRQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" width="1456" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:558245,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/170026527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong>Thanks for reading!</strong> If you found this post useful, please consider subscribing. I share hands-on computational biology techniques, fresh ways to think about tough problems, and perspectives on a range of related topics. All free, straight to your inbox:</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://sequenceanddestroy.substack.com/subscribe?"><span>Subscribe now</span></a></p><div><hr></div><h2>Issue &#8470; 61 // Orthogonal Validation Over Replication </h2><p>I recently shared the following note, which was a reflection on the fact that molecular biology research typically relies on indirect measurement techniques to infer the presence, abundance, and activity of genes, proteins, and metabolites (as well as the downstream consequences).<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><div class="comment" data-attrs="{&quot;url&quot;:&quot;https://open.substack.com/home&quot;,&quot;commentId&quot;:185492554,&quot;comment&quot;:{&quot;id&quot;:185492554,&quot;date&quot;:&quot;2025-12-08T13:05:53.081Z&quot;,&quot;edited_at&quot;:null,&quot;body&quot;:&quot;Every biological dataset is separated from the underlying biology by layers of abstraction generated from measurement technologies, each introducing its own distortions. RNA-sequencing doesn&#8217;t measure transcript abundance directly&#8212;it infers relative abundance which imperfectly correlates with actual mRNA levels due to varying reverse transcription efficiency, PCR amplification bias, sequencing error rates, and computational alignment artifacts. Mass spectrometry proteomics is even more indirect, with detection probability varying by orders of magnitude across proteins based on ionization efficiency and peptide properties unrelated to biological abundance. Fluorescence microscopy measures photon emission from fluorophores attached to antibodies binding to epitopes, any step of which can fail or vary in ways that have nothing to do with the protein we intend to visualize. \n\nThis measurement layer is never perfectly transparent&#8212;we&#8217;re always seeing biology through a distorting lens&#8212;yet our analyses often treat measurements as if they directly report biological quantities. We normalize, correct, and adjust, but these  fixes assume we understand the measurement process well enough to invert it. The unsettling reality is that a portion of what we attribute to biological variation might just reflect variation in how different samples interact with our measurement technologies.&quot;,&quot;body_json&quot;:{&quot;type&quot;:&quot;doc&quot;,&quot;attrs&quot;:{&quot;schemaVersion&quot;:&quot;v1&quot;},&quot;content&quot;:[{&quot;type&quot;:&quot;paragraph&quot;,&quot;content&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;text&quot;:&quot;Every biological dataset is separated from the underlying biology by layers of abstraction generated from measurement technologies, each introducing its own distortions. RNA-sequencing doesn&#8217;t measure transcript abundance directly&#8212;it infers relative abundance which imperfectly correlates with actual mRNA levels due to varying reverse transcription efficiency, PCR amplification bias, sequencing error rates, and computational alignment artifacts. Mass spectrometry proteomics is even more indirect, with detection probability varying by orders of magnitude across proteins based on ionization efficiency and peptide properties unrelated to biological abundance. Fluorescence microscopy measures photon emission from fluorophores attached to antibodies binding to epitopes, any step of which can fail or vary in ways that have nothing to do with the protein we intend to visualize. &quot;}]},{&quot;type&quot;:&quot;paragraph&quot;,&quot;content&quot;:[{&quot;type&quot;:&quot;text&quot;,&quot;text&quot;:&quot;This measurement layer is never perfectly transparent&#8212;we&#8217;re always seeing biology through a distorting lens&#8212;yet our analyses often treat measurements as if they directly report biological quantities. We normalize, correct, and adjust, but these  fixes assume we understand the measurement process well enough to invert it. The unsettling reality is that a portion of what we attribute to biological variation might just reflect variation in how different samples interact with our measurement technologies.&quot;}]}]},&quot;restacks&quot;:3,&quot;reaction_count&quot;:23,&quot;attachments&quot;:[],&quot;name&quot;:&quot;Evan Peikon&quot;,&quot;user_id&quot;:38996408,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fff6889dc-95f3-428f-a2cf-d890b7cba3d1_454x454.png&quot;,&quot;user_bestseller_tier&quot;:null,&quot;userStatus&quot;:{&quot;bestsellerTier&quot;:null,&quot;subscriberTier&quot;:null,&quot;leaderboard&quot;:null,&quot;vip&quot;:false,&quot;badge&quot;:null,&quot;paidPublicationIds&quot;:[],&quot;subscriber&quot;:null}},&quot;source&quot;:null,&quot;forumChannel&quot;:null}" data-component-name="CommentPlaceholder"></div><p>Anyone who works with biological data knows this intuitively, yet it&#8217;s easy to forget that our measurement technologies themselves are not perfectly transparent and can distort our observations in subtle ways. In computational biology we address this issue through quality control, normalization, and batch correct. But, the unsettling reality is that a portion of what we consider to be biological signal might reflect variations in how different samples interact with our measurement technologies<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>.</p><p>For example, RNA-sequencing is prone to GC content bias and amplification preferences, stemming from the physicochemical properties of DNA/RNA, which affects which transcripts we detect and how we quantify them. Additionally, mass spectrometry-based proteomics favors certain peptides based on ionization efficiency and often misses low-abundance proteins entirely, while antibody-based methods depend on epitope accessibility that may not reflect true protein abundance or activity. These types of errors aren&#8217;t random and as a result they don&#8217;t just average out given enough data. They&#8217;re systematic biases inherent to how each measurement technology interrogates biological systems.</p><p>This brings us to something I&#8217;ve been thinking a lot about recently, which is that when different sequencing runs, technicians, or reagent lots produce different results, we often attribute it to batch effects that need correction. But, batch correction, by definition, can only address variations between uses of the same measurement technology. It doesn&#8217;t, and cannot, address the inherent way that technology affects what we ultimately measure. We&#8217;re always measuring biology filtered through the lens of our chosen instrumentation, limitations and biases included, and no amount of normalization or batch correction can remove these types of systematic distortions. If this is true&#8212;and I&#8217;m open to the possibility that my logic is flawed&#8212;then it leads to the realization that we often can&#8217;t distinguish between measurement technology-intrinsic bias and true biological signals when working within a single measurement modality. Using the RNA-sequencing example again, consider a given reproducible finding&#8230; can we be certain we&#8217;re identifying true transcriptional regulation, or is it possible we&#8217;re capturing a systematic bias in how RNA-seq captures certain transcript features?</p><p>When we apply batch correction, we make certain assumptions about what constitutes technical versus biological variation. However, these assumptions can&#8217;t be validated within a given measurement technology. Additionally, if biological and technical variation correlate (for example, diseases tissue being processed differently than controls, samples being collected at different time points, population structure confounding with processing batch, etc), our corrections may inadvertently remove real biological signals or introduce spurious patterns.  </p><p>This is where an indiscriminate fixation on scientific reproducibility, which while foundational to science, can introduce a blind spot. We treat replication across labs, datasets, and study cohorts as the gold standard for biological truth, but reproducible doesn&#8217;t always mean correct&#8212;it could mean that our systematic errors are consistent. For example, if a given set of measurement protocols introduce similar biases and downstream analysis pipelines make the same normalizations assumptions we&#8217;ll get perfectly reproducible results that all reflect the same technical artifacts. As a result, some of our most reproducible findings&#8212;the ones that replicate across dozens of studies&#8212;might be measuring properties of our measurement technologies rather than properties of biology.</p><p>Of course, not all reproducibility is created equal. Replication using identical protocols in different labs is weaker evidence than replication using varied protocols within the same measurement technology. But both are fundamentally limited by the fact that they're looking at biology through the same technological lens. The solution to this problem isn&#8217;t more reproducibility through the same analytics lenses&#8212;It&#8217;s orthogonal validation using fundamentally different measurement approaches. When RNA-sequencing, high-throughout proteomics, and functional assays all independently point to the same conclusion despite having different technical biases you&#8217;re likely to have triangulated in on a true biological signal<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>.   </p><p>This is also why the reflexive integration of multi-omics data can be counterproductive. Sometimes we don&#8217;t want to merge transcriptomics and proteomics into a unified model&#8212;we want each to speak independently, then ask whether they&#8217;re telling the same story. If they converge despite looking at biology through entirely different technical lenses, the signal is real. If they diverge, we&#8217;ve learned something about either the biology or our measurement limitations. The goal isn&#8217;t consensus through integration; it&#8217;s triangulation through independence<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a>.</p><div><hr></div><p><strong>Liked this piece?</strong> If so, tap the &#128420; in the header above. It&#8217;s a small gesture that goes a long way in helping me understand what you value and in growing this newsletter. </p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>I&#8217;ve previously written about this topic, as is relates to measuring muscle oxygenation, in <a href="https://sequenceanddestroy.substack.com/p/understanding-variation-in-muscle?utm_source=publication-search">Dampening the Noise: Making Sense of Variability In Biometric Measurements</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Andrew Carroll recently commented on this in T<a href="https://andrewcarroll.github.io/2025/12/23/the-virtual-cell-will-be-more-like-GWAS-than-AlphaFold.html">he Virtual Cell Will Be More Like Gwas Than Alphafold</a>, where he states &#8220;&#8230;But there are two areas where we still have obstacles to overcome. The first is that experimental batch effects are very strong in these assays. Which sequencer you use, which kits you use, how you prepare the RNA, how you handle the cells before preparation can all have large effects. So the majority of the difference between two datasets can be effectively &#8220;non-biological&#8221; in the sense that what you would learn doesn&#8217;t correspond to the parts of the biology you want to learn.&#8221;</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>This doesn&#8217;t necessarily mean taking all of these measurements at the same point in time, as different omics technologies capture biological processes at different time scales, which needs to be accounted for when interpreting data. For more on this topic see <a href="https://sequenceanddestroy.substack.com/p/multiomics-data-in-4d?utm_source=publication-search">Issue #45 // Time: The Fourth Dimension In Multiomics Data Analysis</a>. </p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>In practice, this could be as simple as performing two separate sets of differential expression analyses using transcriptomics and proteomics data from the same experiment. From these outputs, you can see if paired genes and proteins are up- and down-regulated to a similar degree (acknowledging that RNA and protein abundances don&#8217;t correlate perfectly even in ideal conditions, so we&#8217;re looking for directional consistency rather than quantitative matching). Or, where measured genes and proteins do not align well, you can see if both datasets result in similar pathway enrichment. Additionally, if you were to perform weighted co-expression network analysis with both datasets, comparing control vs treatment networks, you can see if the same patterns emerge (for example, network density increasing across conditions), if similar functional modules form, if corresponding networks across datatypes have similar graphlet structures, etc. The specifics here aren&#8217;t necessarily prescriptive&#8212;the key is asking whether independent measurement modalities converge on the same biological interpretation despite their different technical biases. </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Sequence &amp; Destroy by Evan Peikon is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Issue #60 // Compression As Understanding ]]></title><description><![CDATA[The Tension Between Mechanistic Understanding/Clinical Interpretability and Predictive Biology]]></description><link>https://sequenceanddestroy.substack.com/p/issue-60-compression-as-understanding</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-60-compression-as-understanding</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Sun, 28 Dec 2025 18:16:03 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/045055cd-143e-4ebd-a89e-ac5de0c2e09c_1384x808.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JRQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" width="1456" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:558245,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/170026527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong>Liked this piece?</strong> If so, tap the &#128420; in the header above. It&#8217;s a small gesture that goes a long way in helping me understand what you value and in growing this newsletter. You can also subscribe for free to have the next post delivered to your inbox:</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://sequenceanddestroy.substack.com/subscribe?"><span>Subscribe now</span></a></p><div><hr></div><h2>Issue &#8470; 60 // Compression As Understanding  </h2><p>The most useful computational models in biology aren&#8217;t the ones with the most parameters&#8212;they&#8217;re the ones that achieve the most compression. When you can predict gene expression across dozens of conditions using only a handful of latent factors or explain variations in protein abundance with a small set of regulatory relationships you&#8217;ve found something closer to biological "ground truth" than any super high-dimensional model that just memorizes your training data and spits out a prediction, even if it&#8217;s more accurate (as defined in a specific testing context).  </p><p>Beyond their use as data visualization tools, dimensionality reduction techniques&#8212;like PCA, t-SNE, and UMAP&#8212;can be used to parse the underlying structure of biological systems, which I wrote about previously in <a href="https://sequenceanddestroy.substack.com/p/machine-learning-the-native-language">Issue #36 // Machine Learning: The Native Language of Biology</a>, stating:</p><blockquote><p>"In a neural network, high-dimensional input data is compressed into latent representations&#8212; abstract patterns that capture essential features while discarding noise. Similarly, a transcription factor distills complex environmental information into a binary state (active or inactive) that the cell can use to make decisions. This concept of latent spaces offers another window into why machine learning aligns so well with biology. In cell biology, high-dimensional data like gene expression profiles or microscopy images can be projected into lower-dimensional spaces that capture meaningful biological variation. Each dimension in this latent space ideally corresponds to some biological process or state&#8212;cell cycle phase, differentiation stage, or stress response."</p></blockquote><p>If 20,000 genes can be projected into 50 principal components that capture most of the variance, this tells us the genome isn&#8217;t actually operating in 20,000-dimensional space. It&#8217;s operating in something far lower-dimensional, with the vast majority of genes acting as downstream readouts of a smaller set of regulatory "programs".</p><p>The same idea can be applied to patient stratification. Take breast cancer, for example. We know every patient&#8217;s cancer is unique with its own gene mutations and amplifications, alterations to transcriptional programs, behaviors, and responses to treatment. This idea is encapsulated by the Anna Karenina principle in Oncology, which states something to the effect that all healthy tissues are alike, whereas all diseased tissues are diseased in their own way. Yet, if we can compress thousands of clinical and molecular features into a small number of subtypes that meaningfully predict treatment response (such as ER/HER status, PAM50 subtypes, MammaPrint risks profiles, etc), there is an argument to be made that despite enormous molecular complexity, the relevant biological variation between tumors has low intrinsic dimensionality<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. </p><p>However, there is a strong argument to be made that compression is most useful when it leads to increased understanding of the underlying biological systems you&#8217;re working on. An auto-encoder can compress gene expression data perfectly, while learning latent representations that cannot be biologically, or mechanistically, interpreted. The representations that matter are the ones that align with biological processes we can observe, measure, or manipulate because those are the dimensions along which we can actually intervene. </p><p>This is why interpretable models often outperform black boxes in biological and clinical applications/use cases, even when they&#8217;re less accurate. A decision tree that says "if local tissue hypoxia and immune infiltration, then poor prognosis" is testable biology. A neural network that achieves 2% better accuracy but can&#8217;t explain its reasoning in biological terms may just be performing complex pattern matching. In this way, the goal of computational modeling isn&#8217;t prediction for its own sake. It&#8217;s to build models that can compress biological complexity and heterogeneity in ways that suggest new experiments, reveal potential mechanisms or action, and ultimately show us levers that we can pull to intervene in disease. </p><p>George Church recently gave a great interview on this topic for the Lifespan Research Institute which you can find <a href="https://lifespan.io/news/george-church-on-building-scientific-superintelligence/">here</a>. I&#8217;ve reposted a short except below:</p><blockquote><p><strong>About that: it&#8217;s always interpretability versus the model&#8217;s power. Where are you in this debate? Would you prefer a weaker but more interpretable AI or a stronger but less interpretable one?</strong></p><p>GC: I lean on the interpretability side. It&#8217;s not an either-or, but&#8230; we&#8217;re in science. Few engineers are willing to just pull a rabbit out of a hat, just a black box. Scientists and engineers, by and large, want to know the mechanism. The FDA likes to know mechanisms. Typically, the autocatalytic loop where you learn something and then you invent something is better if it&#8217;s mechanistically grounded. So, I lean pretty heavily in the direction of interpretability, explainability, transparency, et cetera, and also it&#8217;s safer.</p><p><strong>I just honestly think that we will soon be faced with this dilemma, where we will have to choose between the power of the model to do things and its actual interpretability, but maybe we&#8217;re not there yet.</strong></p><p>GC: If you look at the human scientist experience, the most powerful sciences are the ones that are better articulated mechanistically on a solid foundation rather than black boxes. The black boxes tend to include artifacts, dead ends. Most of the progress in science and engineering has been part of community efforts with strong mechanistic underpinnings.</p></blockquote><p>These ideas stand opposed to the core thesis of <a href="https://blog.jck.bio/p/predictive-biology">predictive biology</a>, which argues that predicting the outcome of an unknown experiment is equivalent to understanding a system. While beyond the scope of this article, the tension between "mechanism as understanding" and "prediction as understanding" is interesting to reflect on. As a biologist, I tend to lean towards the former. </p><p>This tension has played out in my own research a bit, where I&#8217;ve worked with both Cypher query-based graph traversal methods and graph neutral network-based approaches to predict phenotypes from molecular perturbation data. The former is more interpretable, allows for novel pathway discovery, and provides more causal information, which both makes it more trustworthy from a clinical standpoint. Yet, the GNN based approaches often make predictions that the traversal based ones miss, which while hard to validate the utility of are interesting nonetheless. Of course, there are ways to take the best of both of these worlds and create a hybrid system, but at the end of the day we still need to decide how much we&#8217;re willing to trust predictions we don&#8217;t fully understand, and the degree to which we need our models to be mechanistic and interpretable for them to be useful. </p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>The big caveat here is that the compressed subtypes have to predict treatment response better than higher dimensional alternatives in order to be considered meaningful. As I&#8217;ve written about <a href="https://substack.com/@evanpeikon/note/c-184470773">previously</a>, clustering techniques will always "work" in that they will produce partitions, or groupings, in data whether or not those partitions reflect biological reality. It&#8217;s up to us to determine if the boundaries between clusters reflect genuine biological discontinuities or arbitrary algorithmic choices, which is why models with mechanistic explainability are so valuable.  </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Sequence &amp; Destroy by Evan Peikon is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Issue #59 // Castles on Quicksand]]></title><description><![CDATA[A Consumer Health Tracking Story]]></description><link>https://sequenceanddestroy.substack.com/p/issue-59-castles-on-quicksand</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-59-castles-on-quicksand</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Mon, 15 Dec 2025 12:58:38 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/a75d2b5e-b9ea-41fc-b555-8b14c50aa425_1438x846.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JRQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" width="1456" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:558245,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/170026527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong>Enjoy this issue?</strong> If so, you can show your support by tapping the &#128420; in the header above. It&#8217;s a small gesture that goes a long way in helping me understand what you value and in growing this newsletter. Thanks!</p><div><hr></div><h2>Issue &#8470; 59 // Castles on Quicksand: A Consumer Health Tracking Story</h2><p>Last month, a friend showed me the Whoop app&#8217;s latest feature with a level of enthusiasm typically reserved for winning the lottery. "Look!," he said, pointing to a number on his iphone screen, "My biological age is three years younger than my chronological age!" </p><p>Hearing my friend wax poetic about his Whoop Age left me in an uncomfortable position. As someone who founded a wearable biosensor company and understands the limits of current sensing modalities&#8212;and actively works on developing new protein biomarkers&#8212;I had a strong inkling that this measurement was less than accurate, to put it kindly. After all, his Whoop wasn&#8217;t measuring his telomere length, DNA methylation patterns, or protein aggregation markers. It was measuring his heart rate, HRV, blood oxygenation, skin temperature, and movement, then applying algorithms trained on population-level data to generate what appeared to be a legitimate biological assessment. This disconnect between the information Whoop claimed to provide and what their devices actually measures was surprisingly to say the least<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>.  </p><p>Despite my strong opinions about the validity of some of Whoop&#8217;s digital biomarkers, I don&#8217;t believe the company is malicious, nor do I think they intend to deceive consumers. What I do believe is that they, and other consumer health companies, are operating in a hyper-competitive environment where the reward for sensational and easily marketable features is increased consumer adoption and the penalty for conservative claims is irrelevance. In this environment the question isn&#8217;t whether any individual company will make unsubstantiated claims or market a feature their technology can&#8217;t support&#8212;instead, it&#8217;s who will be the first to do it, and how far they can stretch these limits until consumers loose faith in their technology. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZLMz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" width="48" height="48" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:800,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:14436,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179722635?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw"></picture><div></div></div></a></figure></div><p>To fully appreciate why the current generation of consumer health devices represents such a profound case of measurement mismatch, we need to start with what we know about human biology and health from a bottom-up perspective. Take anti-aging&#8212;the poster child for speculative consumer health claims. The past two decades of research, propelled by next generation sequencing, have shown aging to be a complex multi-system phenomenon involving cellular senescence, mitochondrial dysfunction, protein aggregation, genomic instability, and epigenetic drift. All of which are processes several layers of biological abstraction removed from anything a wrist-worn optical sensor can detect<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. </p><p>The same principle applies to brain health and immune function among other areas of health. The most meaningful biomarkers exist at the levels of gene expression, protein abundance, metabolite concentrations, and methylation patterns. Measuring these things accurately requires high-throughput RNA sequencing, mass spectrometry, immunoassays, imaging techniques, and other technologies. A device that measures heart rate, skin conductance, and movement patterns is an entirely different universe of biological information.</p><p>This isn't to say that the physiological parameters measured by consumer devices are meaningless or lack utility. Heart rate variability does correlate with autonomic nervous system function, sleep architecture does relate to metabolic health, and activity patterns do influence cardiovascular risk. But correlation is not causation, and correlation is certainly not equivalent to direct measurement. When devices measure heart rate variability and claim to be assessing "recovery" or measure blood glucose and claim to measure "metabolic health" their providers are making conceptual leaps over gaps that cannot be spanned by objective fact. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZLMz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" width="48" height="48" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:800,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:14436,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179722635?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Whoop&#8217;s "Pace of Aging" score provides a perfect case study into how this measurement gap manifests itself in practice. Whoop, as company, has positioned its product as a leading form-first wearable<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> and its devices genuinely excel at what their hardware is designed to measure &#8212; namely <a href="https://www.mdpi.com/1424-8220/22/16/6317">heart rate and heart rate variability</a>. </p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CXAB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfcedeba-062d-4043-b489-c5b3ace4fce9_869x333.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CXAB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfcedeba-062d-4043-b489-c5b3ace4fce9_869x333.png 424w, https://substackcdn.com/image/fetch/$s_!CXAB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfcedeba-062d-4043-b489-c5b3ace4fce9_869x333.png 848w, https://substackcdn.com/image/fetch/$s_!CXAB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfcedeba-062d-4043-b489-c5b3ace4fce9_869x333.png 1272w, https://substackcdn.com/image/fetch/$s_!CXAB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfcedeba-062d-4043-b489-c5b3ace4fce9_869x333.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CXAB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfcedeba-062d-4043-b489-c5b3ace4fce9_869x333.png" width="476" height="182.40276179516687" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cfcedeba-062d-4043-b489-c5b3ace4fce9_869x333.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:333,&quot;width&quot;:869,&quot;resizeWidth&quot;:476,&quot;bytes&quot;:179553,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/171825956?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F678cc708-b0a6-4f94-a3bb-c31bf2c84381_900x364.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CXAB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfcedeba-062d-4043-b489-c5b3ace4fce9_869x333.png 424w, https://substackcdn.com/image/fetch/$s_!CXAB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfcedeba-062d-4043-b489-c5b3ace4fce9_869x333.png 848w, https://substackcdn.com/image/fetch/$s_!CXAB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfcedeba-062d-4043-b489-c5b3ace4fce9_869x333.png 1272w, https://substackcdn.com/image/fetch/$s_!CXAB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcfcedeba-062d-4043-b489-c5b3ace4fce9_869x333.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption">Bland-Altman plots for laboratory-based vs. wearable deviced-derived HRV and heart rate measurements (<a href="https://www.mdpi.com/1424-8220/22/16/6317">source</a>).</figcaption></figure></div><p>The problems begin when accuracy with their primary sensing modality&#8212;photoplethysmography&#8212;becomes the basis for claims that venture far beyond what the underlying hardware and sensors can support. The biological age and pace of aging measurements are the most egregious examples, but they are hardly unique. </p><p>Consider what would actually be required to estimate biological age accurately. Morgan Levine and colleague&#8217;s <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC5940111/#sec16">DNAm PhenoAge</a>, one of the more sophisticated approaches to biological age prediction, incorporates chronological age, albumin, creatinine, glucose, C-reactive protein, lymphocyte percentage, mean corpuscular volume, red cell distribution width, alkaline phosphatase, and white blood cell count in it&#8217;s model. Yet, even this approach has its limitations in that biological aging is organ-specific. Your liver, brain, and cardiovascular system don't age in lockstep, and as a result any single metric inevitably obscures important biological heterogeneity. Capturing this heterogeneity is possible, but it requires high-throughout proteomics, measuring thousands of plasma proteins simultaneously, as demonstrated in <a href="https://www.nature.com/articles/s41591-019-0673-2">this seminal paper</a> from of the Wyss-Coray lab at Stanford. </p><p>Whoop&#8217;s approach, by contrast, relies on algorithms trained on population-level data to identify correlations between lifestyle factors and age-related outcomes. In their own words, "Whoop Age and Pace of Aging are designed to reflect well-established links between behavior, physiology, and long-term health. While WHOOP Age correlates with perceived health and general wellness at a population level, there is no clinical benchmark for validating these metrics&#8203;."<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> This approach might have some predictive value at the population level, but presenting it as a precise measurement of biological age feels a bit suspect. </p><p>The tragedy is that Whoop&#8217;s underlying, unadulterated, measurements are valuable. They don&#8217;t even require inflated claims to justify their use. So, why then do they take this approach? The answer lies in what I call the single-point sensing trap. When you&#8217;ve built your entire brand around the premise that a wrist-worn band can provide comprehensive health insights&#8212; as Whoop claims with their promise to "improve performance, build healthier habits, and extend health span with continuous health monitoring"&#8212; you&#8217;ve painted yourself into a technological corner. To admit that holistic health monitoring might require multiple distributed sensing modalities would undermine what makes your product marketable in the first place. </p><p>This constraint creates a perverse innovation cycle. Instead of expand their sensing capabilities to match their ambitious claims, companies like Whoop are forced to stretch their existing sensor data even thinner, creating increasing elaborate algorithmic castles built on quicksand. Each new feature, from sleep coaching to strain optimization to biological age estimation, represents another layer of abstraction designed to extract insights that their hardware just can&#8217;t capture. </p><p>Meanwhile, a schism is starting to open in the consumer health space between companies doubling down on single-point sensing, like Whoop, and those that are beginning to embrace a distributed sensor architecture, which I&#8217;ve previously written about in <a href="https://sequenceanddestroy.substack.com/p/distributed-by-design">Issue #48: Distributed by Design</a>. Oura&#8217;s recent integration with Stelo speaks to this. When Oura acknowledges that ring-based measurements need to be complemented by continuous glucose data to provide meaningful metabolic insights, they're admitting something that should be obvious but remains unspoken: human biology is too complex for any single sensing modality to capture.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZLMz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" width="48" height="48" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:800,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:14436,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179722635?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>The transition from single-point to multi-point sensing enables something more profound that just measurement diversity; it opens the door to discovering <a href="https://sequenceanddestroy.substack.com/publish/post/157757913">network biomarkers</a>, a conceptual framework that treats biological relationships rather than isolated metrics as the fundamental unit of health assessment. </p><p>Consider what this might mean for the biological age measurement that so captivated my friend. Instead of trying to compress the complexity of multi-system aging into a single score derived from heart rate and sleep patterns, imagine a distributed sensor network that treats aging as what it actually is: a heterogeneous, organ-specific, process with complex interdependencies. Such a system might integrate data from a smartwatch or ring, continuous glucose monitor, muscle oximeter, smartphone sensors (including the camera and microphone), and periodic laboratory biomarker panels. Rather than generating disparate data streams, these inputs would feed into models designed to map causal relationships and treat the network itself as the biomarker. </p><p>Additionally, by collecting continuous digital biomarkers alongside periodic molecular measurements, we can map the causal relationships between different layers of biological organization, creating a fundamentally different paradigm for personalized health monitoring than what exists today<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a>. For example, we could build causal models that map how changes in your diet, exercise, and daily behaviors impact sleep quality, specific inflammatory markers, and digital biomarkers like HRV (and how these things in turn impact each other, mapping the various circular dependencies and chains of cause-and-effect). </p><p>The practical applications of this approach are many. Imagine you notice your heart rate variability declining over several weeks coincident with changes in sleep architecture, more volatile blood glucose measurements following meals, and mild muscle tension dysphonia sensed via your iPhone&#8217;s microphone. Traditional approaches might flag this as "low readiness", "high strain", or some other vague wellness claim. A network biomarkers approach, on the other hand, might recognize this pattern as a potential signal of infection or systemic inflammation and recommend targeted laboratory testing for specific inflammatory cytokines, such as TNF-&#945;, IL-1, and IL-6, not only allowing for it&#8217;s predictions to be validated, but further feeding data into the causal models. Perhaps more importantly, by mapping causal chains between all of these variables, and the relative strength of those causal relationships, we can determine the most impactful interventions, which is something I&#8217;ve written about in <a href="https://sequenceanddestroy.substack.com/p/breaking-biometric-babel">Breaking Biometric Babel</a>. </p><p>This also seems to align with Oura's thinking, as indicated by Maz Brumand, their VP of product, who told CNET: "By combining Stelo data with Oura's existing insights, we're empowering members to better understand the cause-and-effect relationships between eating patterns, energy, mood and recovery and ultimately make sustainable, science-backed lifestyle changes."</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZLMz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" width="48" height="48" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:800,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:14436,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179722635?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Seeing one of the biggest players in the industry embrace the aforementioned approach validates what should be clear: that the measurement gap that characterizes today's consumer health landscape isn't immutable. It's the predictable result of trying to extract complex biological insights from individual sensors measuring isolated parameters. However, as the industry shifts from single-point sensing to distributed sensing networks and network biomarkers, the door to genuine biological insights opens wide.</p><p>This transition will likely create a bifurcation in the market. Companies that continue building increasingly elaborate castles on quicksand&#8212;layering algorithmic complexity onto fundamentally inadequate sensing&#8212;will find their structures collapsing under their own weight as consumers become more sophisticated about what these devices can and cannot measure. Meanwhile, those embracing distributed sensing will capture the growing segment of users who want real, actionable, health insights. </p><p>The question isn't whether this shift will happen, but how quickly the industry will recognize that solid ground lies not in more clever algorithms, but in acknowledging the inherent complexity of human biology and building sensing architectures that match that complexity.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>According to Whoop, "Whoop Age is a measure of your physiological age, which can be younger or older than your actual, chronological age." On <a href="https://support.whoop.com/s/article/Healthspan-WHOOP-Age-Pace-of-Aging-Guide?language=en_US">Whoop&#8217;s support page </a>they go on to explain how Whoop Age is calculated, stating "Whoop analyzes the following key metrics to determine your Whoop Age: Sleep, Strain, Fitness (Resting Heart Rate, VO2 Max, Lean Body Mass (if available).&#8221; In other words, Whoop claims to predict your age based on your sleep (loosely correlated with age), Strain (a made up measurement), resting heart rate (poor predictor of age within their target demographic), VO2 max (which their device can&#8217;t measure), and lean body mass (another metric they don&#8217;t measure)... seems legit. </p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Not to mention the fact that even the most precise aging clocks, utilizing proteomic data from blood or tissue biopsies, are only accurate at the organ-specific level. That is to say,  the idea of a single "biological age" is unfounded in that different organs age at different rates, which could be uncouple from one another in the case of certain disease states. </p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p> I&#8217;ve written about the dichotomy between form-first and function-first wearable technology development in <a href="https://sequenceanddestroy.substack.com/p/the-garden-of-technological-possibilities">The Garden of Technological Possibilities</a>. </p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Interestingly Whoop failed to mention the lack of clinical benchmarks or validation for their aging measurements in their marketing and the 42-page product release document titled <a href="https://assets.ctfassets.net/rbzqg6pelgqa/2e36t5aQhhrNmsUgpBtDT4/589dba62ee24e29d1093c600cbb4faf9/WHOOP-2025-White-Paper-Healthspan-RND4.pdf">"The Whoop Healthspan Feature: Advancing Personalized Longevity</a>". Instead, it was briefly mentioned on their less publicly facing <a href="https://support.whoop.com/s/article/Healthspan-WHOOP-Age-Pace-of-Aging-Guide?language=en_US">support page</a>. </p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>Doing so requires incorporating time-dependent relationships in multiomics data, implementing methods to deconvolute cause-and-effect relationships, and building systems that allow for easily mapping between different forms of biological data, from digital biomarkers to proteomic measurements, to outcomes, which i&#8217;ve written about <a href="https://sequenceanddestroy.substack.com/p/multiomics-data-in-4d">here</a>, <a href="https://sequenceanddestroy.substack.com/publish/post/171942524?back=%2Fpublish%2Fposts%2Fscheduled">here</a>, and <a href="https://sequenceanddestroy.substack.com/p/computational-strategies-for-mapping">here</a> respectively. </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Sequence &amp; Destroy by Evan Peikon is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Issue #58 // How to Kill a Tumor]]></title><description><![CDATA[Three perspectives on rationale drug selection in personalized cancer therapy]]></description><link>https://sequenceanddestroy.substack.com/p/issue-58-how-to-kill-a-tumor</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-58-how-to-kill-a-tumor</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Sun, 30 Nov 2025 14:03:11 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/1a7ca196-e84f-4b31-8a03-317d9e1420ab_1456x852.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JRQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" width="1456" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:558245,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/170026527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong>Did you enjoy this issue?</strong> If so, you can show your support by tapping the &#128420; in the header above. It&#8217;s a small gesture that goes a long way in helping me understand what you value and in growing this newsletter. Thanks!</p><div><hr></div><h2>Issue &#8470; 58 // H&#248;w to Kill a Tum&#248;r</h2><p>I was staring at a series of protein heat maps when I realized I had a problem. Not the usual <em>SyntaxError</em> problem. My code ran to completion and the visualizations it generated looked fine. The problem was philosophical. Depending on which analytical lens I applied to the same clinical trial dataset different proteins appeared to be the "best" therapeutic target. The first protein that caught my attention was EGFR T1068, showing a clean eight-fold up-regulation in tumor samples compared to adjacent healthy tissue, with pathway enrichment analysis showing the activation of downstream growth signaling cascades. Kaplan-Meier and Cox hazard analysis even showed EGFR T1068 to be a strong negative prognostic marker in the trial&#8217;s patient population. This looked like textbook oncogene addiction, where cancer cells had become dependent on hyperactive EGFR signaling for their survival and proliferation. The verdict seemed clear: inhibit EGFR, break the addiction, eliminate the cancer. </p><p>But, when I shifted my focus to surface proteomics data, searching for potential antibody-drug conjugate (ADC) targets, a different protein stood out. TROP2, also known as TACSTD2, showed high tumor-specific expression despite having no obvious role in driving malignant transformation in this specific context. Yet, it was abundantly present on tumor cell surfaces, and had excellent internalization kinetics, making it an ideal postal address to deliver cytotoxic payloads to. The logic for drugging<strong> </strong>TROP2 was fundamentally different from EGFR. TROP2 didn&#8217;t need to be a driver of cancer; it just had to be present, accessible, and able to carry a lethal package across the cell membrane. </p><p>Network topology analysis revealed yet another drug target candidate entirely. A protein called NEDD9 consistently showed up as a high degree and betweenness centrality hub in tumor co-expression networks while maintaining low centrality in normal tissue networks. NEDD9 wasn't dramatically over-expressed, showing only a modest ~1.3-fold elevation compared to normal samples. But its position at the intersection of multiple signaling pathways made it a critical coordinator of cellular communication within the malignant "ecosystem". Disrupt NEDD9, the network analysis suggested, and you could trigger cascading failures throughout the tumor's organizational structure&#8212;like removing a critical router from a communication network. </p><p>Three different analytical approaches; three different answers for how to kill a tumor. Each one backed by strong statistical evidence and reasonable biological rationale, and each one representing a distinct philosophy about what makes cancer vulnerable to therapeutic intervention. Are these competing truths&#8212;or just different projections of, and perspectives on&#8212;the same underlying biology?</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZLMz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" width="48" height="48" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:800,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:14436,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179722635?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw"></picture><div></div></div></a></figure></div><p>The multiplicity of tumor killing philosophies didn't emerge in a vacuum; it reflects the evolution of cancer biology itself. The driver-focused approach stems from the work of Bert Vogelstein and others, in the 1980&#8217;s and 1990&#8217;s, who showed that cancer progression follows predictable patterns of genetic alteration. Their discovery, that specific oncogenes like RAS and tumor suppressors like p53 are mutated across many cancer types, provided the conceptual framework for targeting the molecular engines of malignancy. This was the era when cancer research began to feel less like taxonomy and more like engineering. If we could identify the "broken parts" in a cancer cell, we could fix them, or at least jam them up enough to stop the disease. </p><p>This mechanistic worldview dominated cancer research for decades, leading to some amazing successes. When Brian Druker pioneered the research and clinical development of Imatinib (developed by Novartis) to target the BCR-ABL fusion protein in chronic myeloid leukemia, he wasn&#8217;t just advocating for a drug&#8212;he was validating an entire philosophical approach to cancer treatment<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. Here was proof that if you understood the molecular driver of cancer and could drug it effectively, you could stop cancer in its tracks. The results were dramatic enough that some oncologists started talking about "functional cures" for a disease that had been a death sentence just a few years earlier. Yet, even as targeted therapies revolutionized treatment for specific cancer subtypes, researchers were discovering the limitations of pure driver-focused approaches. Cancer cells, frustratingly, have the ability to develop resistance through secondary mutations and pathway rewiring. The very genetic instabilities that create driver genes and proteins also enable escape mechanisms that render them undruggable. </p><p>The use of antibody-drug conjugates (ADCs) emerged partly as a response to these limitations. If cancer cells can adapt and evade our mechanistic understanding, perhaps it warrants a shift from disrupting their biology to simply delivering cytotoxic payloads with high precision. The philosophy here is both cynical and pragmatic&#8212;stop trying to understand what makes a cancer cell tick, and just find a way to selectively poison it.  This concept culminated in the development of compounds like Trastuzumab-Emtansine, combining the anti-HER2 antibody Trastuzumab with the cytotoxic agent Emtansine, where HER2&#8217;s role as a cancer driver becomes secondary to its utility as a giant bullseye on a tumor cell&#8217;s surface. </p><p>Meanwhile, the explosion of high-throughput sequencing in the early 2000s enabled systems biologists to construct comprehensive molecular interaction networks and identify emergent properties that weren't apparent from studying individual proteins in isolation. These researchers were asking fundamentally different questions. Instead of "which protein is dysfunctional?" or "which protein is abundant?", they asked "which protein, if removed, would cause the most widespread  systems failure?" Network-based approaches promised to reveal organizational vulnerabilities that traditional reductionist methods might miss&#8212;nodes whose disruption could cascade through interconnected pathways to achieve therapeutic effects that exceeded the sum of their molecular parts<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZLMz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" width="48" height="48" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:800,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:14436,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179722635?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>The tension between these different tumor killing approaches became increasingly apparent as I dug deeper into the oncology literature. EGFR T1068, the clear winner from my driver analysis, has been targeted clinically for years with mixed results. While EGFR inhibitors like Erlotinib have proven to be efficacious in treating lung cancer, their performance has been less stellar in other forms of cancer, like breast cancer. Unfortunately, this happened to be the experimental condition I was working on. So, despite having strong statistical evidence suggesting a certain therapy, the underlying experimental system had too many feedback loops, compensatory pathways, or microenvironmental factors that could potentially blunt it&#8217;s impact. </p><p>TROP2, my ADC target candidate, represented a different set of trade-offs. Recent clinical trials with TROP2-targeting ADCs had shown promising early signals, but the approach carried its own risks. Any surface protein abundant enough to serve as an effective delivery address might also be expressed at low levels in healthy tissues, creating potential toxicity concerns. The elegance of identifying tumor-specific surface markers has to contend with the messy reality of off-target effects and normal tissue cross-reactivity. The pharmaceutical literature was full of ADCs that appeared promising in preclinical models but caused dose-limiting toxicities in patients because the "tumor-specific" marker turned out to be expressed in the liver, lung, or other tissue at levels just high enough to matter. </p><p>This made NEDD9, with its high network centrality, conceptually appealing, but making the jump from network topology to a rational therapeutic target required navigating somewhat uncharted territory. After all, despite there being a number of proteins that have been shown to be network hubs across various cancer types, few have been successfully targeted in the clinic. The problems with network-based drug targeting is partly technical, partly conceptual. First off, high centrality-proteins are often structurally challenging to drug (large, flat surfaces). Additionally, while we&#8217;ve perfected the mathematical frameworks for identifying important nodes in a biological network, we&#8217;re still learning how to translate graph theory-based metrics into therapeutic strategies. For example, a protein may have a high degree and betweenness centrality on paper, while simultaneously being functionally redundant in practice, with parallel pathways compensating for its loss. </p><p>Yet, rather than viewing the aforementioned challenges as failures of individual targeting philosophies, I&#8217;ve begun to see them as complimentary perspectives on the same problem, or rather, orthogonal measurement of tumor vulnerability. A protein can simultaneously serve as a mechanistic driver, a payload delivery target, and a network hub, with each property lending itself to different therapeutic opportunities. Additionally, proteins that score highly across multiple analytical frameworks may represent particularly attractive drug targets precisely because they offer multiple routes to cell killing. For example, if EGFR monotherapy could be circumvented through pathway rewiring&#8212;with a notable example being the T790M mutation in the EGFR receptor that leads Gefitinib resistance&#8212;what about a protein that was simultaneously a driver, abundant on the surface, and a network bottleneck? Resistance would require the tumor to solve multiple problems at once.</p><p>With this in mind, I&#8217;ve been thinking about the idea of a multi-dimensional vulnerability assessment for potential drug targets. Instead of choosing between driver analysis, surface targeting, and network disruption, what if we scored proteins across all three dimensions simultaneously? For example, we can identify candidate protein targets based on their differential expression, pathway involvement, or network centrality, then, for each candidate in the list, we can generate a normalized score for each of these criteria (which accounts for different scaling), then create a composite score that combines the three individual scores (where each component is weighted). The weighting itself becomes a tunable parameter&#8212;you might weight surface expression more heavily if you&#8217;re specifically interested in ADC development, or prioritize network centrality if you&#8217;re looking for non-obvious targets that conventional approaches might miss. While most proteins show strong signals in only one analytical framework, a small subset emerge as multi-dimensional targets with high scores across multiple approaches<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>.</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\text{Composite Score} = w_{DE}*\\text{Z-score(DE Score)}+w_{Network}*\\text{Z-score(Network Score)}+w_{Path}*\\text{Z-score(Pathway Score)}&quot;,&quot;id&quot;:&quot;IJVRCFPZUG&quot;}" data-component-name="LatexBlockToDOM"></div><p>When I implemented this scoring system on my clinical trial dataset, something interesting happened. The top-ranked proteins weren&#8217;t necessarily the ones with the most dramatic signals in any single dimension. Instead, they were proteins that performed consistently well across all three frameworks&#8212;showing moderate over-expression, reasonable surface accessibility, and meaningful network positions. These were the proteins that traditional single-lens analyses might overlook in favor of more exceptional, but one-dimensional, candidates.</p><p>Consider the example of HER2 (aka, ERBB2) in breast cancer&#8212;a protein that initially gained attention as a driver oncoprotein but proved equally valuable as an ADC target and network hub. HER2 overexpression clearly drives tumor growth through activation of downstream signaling cascades, but its abundant surface expression also makes it a viable target for ADCs like T-DM1. Additionally, network analyses consistently identify HER2 as a high-centrality node that coordinates multiple aspects of tumor biology as it sits at the intersection of the PI3K/AKT/mTOR pathway, the MAPK cascade, and multiple other signaling networks that cancer cells depend on for survival and proliferation.</p><p>Thus, the clinical success of HER2-targeting agents may derive not just from inhibiting a key driver pathway, but from simultaneously disrupting a critical network node and providing a reliable delivery address for cytotoxic payloads. This multi-dimensional vulnerability may explain why HER2-positive breast cancers, despite their aggressive biology and fast growth, often show excellent responses to targeted therapy combinations. We&#8217;ve been attributing HER2&#8217;s success primarily to its role as a driver oncogene, but perhaps we&#8217;ve been underselling the contribution of its other properties. The fact that you can both inhibit HER2 signaling with small molecules and deliver toxic payloads via HER2-targeting antibodies gives oncologists multiple tools to attack the same target, making resistance that much harder to evolve.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZLMz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" width="48" height="48" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:800,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:14436,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179722635?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Instead of asking "which protein is the best drug target?" the question now becomes, "which combinations of vulnerabilities can we most effectively exploit, while simultaneously minimizing toxicity?" If tumors present multiple, orthogonal vulnerabilities, then therapeutic strategies that address these vulnerabilities simultaneously might achieve synergistic effects while reducing the likelihood of developing resistance. This reframing also suggests why some clinical trials fail despite strong preclinical rationale. A protein might score perfectly on one dimension&#8212;say, as a clear driver&#8212;but if it lacks the other properties that make it druggable or deliverable, the therapeutic window might be too narrow for clinical success.</p><p>By recognizing that different analytical approaches reveal different but equally valid aspects of tumor biology, we can move beyond the philosophical debates that have sometimes divided the cancer research community. The question isn't whether we should target drivers, exploit highly abundant surface proteins with cytotoxic payloads that can be internalized by the cell, or disrupt networks&#8212;it's how we can most effectively integrate these approaches to achieve better patient outcomes. </p><div><hr></div><h4>If you liked this post, you may enjoy the following pieces from Sequence &amp; Destroy&#8217;s Backlog:</h4><ul><li><p><a href="https://sequenceanddestroy.substack.com/p/machine-learning-the-native-language">Issue #36 // Machine Learning: The Native Language of Biology</a></p></li><li><p><a href="https://sequenceanddestroy.substack.com/p/molecular-moonlighting">Issue #55 // Molecular Moonlighting</a></p></li><li><p><a href="https://sequenceanddestroy.substack.com/p/issue-63-graphs-as-metaphor-in-biological">Issue #57 // Graphs As Metaphor In Biological Systems</a></p></li></ul><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>I&#8217;d strongly recommend reading <a href="http://what">The Philadelphia Chromosome: A Genetic Mystery, a Lethal Cancer, and the Improbable Invention of a Lifesaving Treatment</a> which tells the story Imatinib (Gleevec) &#8212; the first true targeted therapy for cancer. </p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>I previously wrote about this concept in <a href="https://sequenceanddestroy.substack.com/p/leveraging-network-analysis-for-drug?utm_source=publication-search">Leveraging Network Analysis For Drug Target Discovery</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>What&#8217;s interesting about this approach, is that targets that show only modest differential expression, network centrality, and pathway involvement can still come out at top candidates when competing with proteins that may score highly in only one of these criteria. </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Sequence &amp; Destroy by Evan Peikon is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Issue #57 // Graphs As Metaphor In Biological Systems ]]></title><description><![CDATA[Representing Biological Systems As Multi-Layered Graphs]]></description><link>https://sequenceanddestroy.substack.com/p/issue-63-graphs-as-metaphor-in-biological</link><guid isPermaLink="false">https://sequenceanddestroy.substack.com/p/issue-63-graphs-as-metaphor-in-biological</guid><dc:creator><![CDATA[Evan Peikon]]></dc:creator><pubDate>Tue, 18 Nov 2025 10:30:39 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/8d67e6e2-c12a-4edc-b91e-c4afe3569dc3_1430x840.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JRQY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png" width="1456" height="314" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:314,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:558245,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/170026527?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!JRQY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 424w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 848w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1272w, https://substackcdn.com/image/fetch/$s_!JRQY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbf112a-126a-4233-97fb-ca9486e8961a_2030x438.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p><strong>Liked this piece?</strong> If so, tap the &#128420; in the header above. It&#8217;s a small gesture that goes a long way in helping me understand what you value and in growing this newsletter. Thanks so much!</p><div><hr></div><h2>Issue #&#8470; 57 // Graphs As Metaph&#248;r In Bi&#248;l&#248;gical Systems </h2><p>There&#8217;s a particular kind of frustration that comes with studying living systems, one that doesn&#8217;t quite exist in physics or chemistry. You can spend years characterizing how a protein behaves in a carefully controlled experiment, publish your findings, and then watch as someone else discovers that same protein doing something completely different in another context. This anecdote isn&#8217;t personal; I've haven&#8217;t spent years studying a single protein, learning everything there is to know about it. But I have experienced a similar widening of my aperture as I went from undergrad where I learned that a protein&#8217;s structure determines its function to grad school where I learned about specific functions for a number of different oncoproteins to my doctoral research where I saw how protein function is highly context dependent, inherently relational, and not always entirely clear. </p><p>Along the way, I also learned to think about biological systems as graphs&#8212;networks of nodes connected by edges, which I wrote about at length in <a href="https://sequenceanddestroy.substack.com/p/computational-strategies-for-mapping">Issue #51 // Mapping Biology&#8217;s Dark Matter</a>. At the molecular level, the nodes might be genes, proteins, or metabolites. The edges represent their relationships: how they interact, regulate each other, compete for resources, are co-expressed, and more. Zoom out, and those molecular nodes aggregate into functional modules such as signaling pathways, molecular functions, and biological processes. Zoom out further still, and you&#8217;re looking at cells, tissues, entire organisms, and populations. It&#8217;s graphs all the way up.</p><p>Most molecular biology research, when you strip away the particulars, does one of two things: it identifies a new node, or it identifies a new edge. A novel protein gets characterized&#8212;that&#8217;s a node. Someone discovers that this protein phosphorylates another protein under certain conditions&#8212;that&#8217;s an edge. The problem, and the thing that makes biology uniquely maddening, is that we don&#8217;t know what all the nodes are, even for relatively well-studied processes. And even when we do know the nodes, mapping the edges between them is extraordinarily slow work.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZLMz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" width="48" height="48" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:800,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:14436,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179722635?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw"></picture><div></div></div></a></figure></div><p>I&#8217;ve written before about <a href="https://sequenceanddestroy.substack.com/p/molecular-moonlighting">molecular moonlighting</a>, which is the phenomenon where a single protein performs multiple, often unrelated functions depending on context. It&#8217;s a good example of why the graph metaphor matters. If you study a protein in isolation, or even in a small set of pairwise interactions, you get an incomplete picture. You might tag it with a fluorophore, watch how it moves when you perturb the system, and conclude that it plays one specific role. But that role might be contingent on a dozen other factors you weren&#8217;t measuring: the presence of particular cofactors, the cell&#8217;s metabolic state, or what other proteins are competing for the same binding sites. Biology is full of higher-order feedback loops. A protein&#8217;s function isn&#8217;t intrinsic&#8212;it&#8217;s emergent from its position in the network.</p><p>The traditional experimental toolkit reflects these limitations<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. If you wanted to understand a biological process twenty years ago, you could really only look at a few nodes at a time. Fluorescent tagging let you watch a handful of proteins and  knockout models let you remove a node and observe the ripple effects. These approaches worked, in the sense that they produced real knowledge, but they were constrained by throughput. Designing an experiment that could simultaneously account for the behavior of dozens or hundreds of interacting components was essentially impossible. </p><p>The last fifteen to twenty years have been different. Next-generation sequencing technologies didn&#8217;t just make it cheaper to sequence DNA&#8212;they enabled a fundamentally different mode of inquiry. Suddenly you could measure the expression levels of thousands of genes in a cell at once. You could characterize tens of thousands of protein-protein interactions in a single experiment. RNA-seq, ChIP-seq, ATAC-seq, single-cell sequencing, spatial transcriptomics&#8212;each of these represents a way to observe many more nodes and edges simultaneously. The graph becomes denser, more complete.</p><p>What becomes visible at this scale are the things that pairwise studies miss. Redundancy, for example. Biological systems are full of backup mechanisms, parallel pathways that only become apparent when you knock out one component and discover that the system compensates. Or context-dependency; a protein that&#8217;s essential in one cell type might be practically irrelevant in another, not because the protein itself has changed, but because the surrounding network is different. These aren&#8217;t edge (<em>pardon the pun</em>) cases. They&#8217;re the norm.</p><p>The knowledge graph framework also clarifies what kinds of questions remain hard. Even with high-throughput tools, we&#8217;re still working with incomplete graphs. Some nodes are easier to observe than others&#8212;genes are straightforward, but post-translational modifications or transient protein complexes are slippery. Some edges are easier to infer: co-expression is a weak signal, but it&#8217;s measurable at scale. Physical protein-protein interactions are stronger evidence, but harder to capture comprehensively. Causal relationships&#8212;this protein activates that pathway, under these conditions&#8212;are the hardest of all, because biology resists clean causality<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. The graph is dynamic. The edges change depending on the state of the system.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZLMz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp" width="48" height="48" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:800,&quot;resizeWidth&quot;:48,&quot;bytes&quot;:14436,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://sequenceanddestroy.substack.com/i/179722635?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ZLMz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 424w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 848w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1272w, https://substackcdn.com/image/fetch/$s_!ZLMz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ec7c5fa-d424-4c32-a649-1265b13f3d81_800x800.webp 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>There&#8217;s an intellectual style I admire, the kind George Church exemplifies, where technical rigor coexists with a willingness to make unexpected connections across disciplines. The graph metaphor invites this. Network theory was developed to study social systems, electrical grids, and the internet<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>. The mathematics of centrality and modularity don&#8217;t care whether the nodes are people or proteins. This isn&#8217;t just analogy; there are real conceptual tools from network science that translate directly. The idea of network motifs, recurring patterns of connections that show up across different systems, has been productively applied to gene regulatory networks. The concept of scale-free networks, where a few highly connected hubs dominate, describes both the World Wide Web and protein interaction networks.</p><p>But the metaphor also has limits, and it&#8217;s worth being clear about them. Real biological systems aren&#8217;t static graphs. They&#8217;re dynamic, probabilistic, and context-dependent in ways that are difficult to capture in a single representation. An edge between two proteins might exist in one cellular state and disappear in another. The graph itself is a moving target. Still, as an organizing framework, it works. It lets us think clearly about what we know and what we don&#8217;t and it suggests where the next productive questions might be. The project of biology from this perspective is one of gradual graph completion. We&#8217;re filling in missing nodes, adding edges, and refining our understanding of which connections matter under which conditions. The pace has accelerated, but the work remains enormous. Even for the relatively well-characterized model organisms&#8212;yeast, worms, flies&#8212;the graphs are incomplete. For human biology, where the complexity is orders of magnitude higher, we&#8217;re still sketching the outline.</p><p>What feels different now isn&#8217;t just the speed of data generation, but the possibility of asking questions that were previously out of reach. If you can measure thousands of variables simultaneously, you can start to model how networks respond to perturbations. You can identify vulnerabilities&#8212;nodes whose removal would collapse an entire pathway. You can predict how a system might compensate for damage. This is the promise of systems biology, computational biology, network medicine: not just cataloging parts, but understanding how they fit together.</p><p>The frustration I mentioned earlier doesn&#8217;t go away. Biological systems remain irreducibly complex. But the graph gives us a way to think about that complexity that&#8217;s both rigorous and generative. It lets us map what we know, acknowledge what we don&#8217;t, and chart a course forward. The edges are still hard to draw, the nodes still incomplete, but at least we can see the shape of the problem.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>I first started thinking about the ideas in this paragraph after reading Yuri Lazebnik&#8217;s <a href="https://www.cell.com/cancer-cell/fulltext/S1535-6108(02)00133-2?_returnURL=https%3A%2F%2Flinkinghub.elsevier.com%2Fretrieve%2Fpii%2FS1535610802001332%3Fshowall%3Dtrue">Can a biologist fix a radio?&#8212;Or, what I learned while studying apoptosis</a>, which makes clear the limitations of reducing biological networks to their component parts. </p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Mapping causal relationships also requires time-series data with a sufficient number of time points, and at the right temporal scale, which is relatively rare in molecular biology. I recently published an open source tool called <a href="https://github.com/evanpeikon/CausalEdge">CausalEdge</a>, which identifies causal influences in this type of data when it is available. </p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Interestingly, a number of the concepts I discussed in <a href="https://sequenceanddestroy.substack.com/p/information-theory-gets-to-the-heart">Issue #49: Information Theory Gets to The Heart of Biometric Analysis </a>were also first developed to study electrical grids and communication over phone lines. </p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://sequenceanddestroy.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Sequence &amp; Destroy by Evan Peikon is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p></div></div>]]></content:encoded></item></channel></rss>