<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[In One Lifetime]]></title><description><![CDATA[On practicing Vajrayana Buddhism. ]]></description><link>https://www.paullitvak.com</link><image><url>https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg</url><title>In One Lifetime</title><link>https://www.paullitvak.com</link></image><generator>Substack</generator><lastBuildDate>Sun, 14 Jun 2026 04:03:16 GMT</lastBuildDate><atom:link href="https://www.paullitvak.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Paul Litvak]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[phowa@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[phowa@substack.com]]></itunes:email><itunes:name><![CDATA[Paul Litvak]]></itunes:name></itunes:owner><itunes:author><![CDATA[Paul Litvak]]></itunes:author><googleplay:owner><![CDATA[phowa@substack.com]]></googleplay:owner><googleplay:email><![CDATA[phowa@substack.com]]></googleplay:email><googleplay:author><![CDATA[Paul Litvak]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[The future of academic journals?]]></title><description><![CDATA[Why journals lose their grip when scientific claims become legible]]></description><link>https://www.paullitvak.com/p/the-future-of-academic-journals</link><guid isPermaLink="false">https://www.paullitvak.com/p/the-future-of-academic-journals</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Mon, 18 May 2026 17:49:57 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!y1Tv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60efbac8-9cfc-4d9b-9b46-49bf4f52b042_500x750.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Marx wrote that in full communism the state would just wither away, but he conveniently left out exactly how. A lot of writing on the future of scientific publishing is like that &#8212; pretty hand wavy when it comes to exactly how we transition into the glorious post-journal future. </p><p>The good news is that the transition is already underway! The reality, though, is that there is a lot yet that will need to happen to get from here to there. In this piece, I&#8217;m going to explain where we are on that transition, tell a detailed story of what that transition could look like, and consider some drawbacks and objections. My goal is to convince you both that this process is in some sense inexorable and that it actually portends something good for science and scientists. There&#8217;s obviously a lot of fear and perhaps a mounting backlash to AI among academics, some of which is for justified reasons. But I think we can genuinely get excited about the possibility of real reform of academic publishing. This is important because it not only affects academia, but it affects all of us downstream consumers of the research process.</p><h3>The present gestures toward the future</h3><p>To that end, there have been a lot of experiments on the future of publishing, from new journals, to full end to end systems that span research submission, review and dissemination. But the effects of these experiments have been modest.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> To understand why, you have to consider all of the different functions an academic journal fulfills: distribution, curation, certification and ultimately prestige. Alt-journals have largely been able to break the monopoly on distribution, curation, and to a lesser extent, certification. But journals have largely maintained a monopoly on prestige - most academics&#8217; careers and funding come from where they publish. In a world flooded by new publications, the brand of a top journal like Science or Nature is a powerful signal of quality. And unlike citations or other metrics of impact which take time to accumulate, publication in a top journal confers instant credibility. For most of academia, grants and tenure can be determined by just a few such papers. </p><p>Of course, one alternative approach is to change how tenure and promotion are conferred to deemphasize brand name journal publications. There are experiments like this underway, notably in the University of Maryland psychology department. This is terrific, but changing the hiring and award policies in each one of hundreds of thousands of academic departments is an uphill battle that will likely take a long time.</p><p>Free journals are an obvious lever that do exist and compete with the for-profit journals. They have made some headway, but coverage by field is patchy and the for-profit journals have still maintained most of their brand advantage. It&#8217;s possible that the brand value of journals will rapidly erode as AI submissions dilute the quality of journals. This could happen, though it&#8217;s likely that journals will do something about this, like augment peer review with AI triage. Or they might try to use AI text detectors (like Pangram) to filter out AI submissions. I doubt that will even partially stem the tide of submissions, though.</p><p>Another approach is to change the model of academic publishing. To date, there have been three kinds of approaches to replacing the prestige part of the traditional journal bundle: the overlay journal approach, the conference model, or relying on post-publication metrics. </p><p>In the overlay approach (e.g. Discrete Analysis, eLife reviewed preprints) they add peer review on top of public access repositories, unbundling curation and certification from distribution. Yet <em>Annals of Mathematics, Inventiones, JAMS</em> remain hugely prestige-laden for tenure. Overlay journals are a small fraction of math publishing and grow slowly. It&#8217;s possible this approach will crowd out for-profit journals over time, but for now the best one can hope for is that these overlay journals can become one alternative.</p><p>In the conference model, most prominent in computer science, conference acceptance and best paper awards are used to confer prestige and tenure. This has its own pathologies - deadline-driven half finished work, and a host of tactics to game the submission system, etc. But more importantly, in experimental sciences where it can take years to gather data to produce evidence for a claim, the short timeline of a conference model wouldn&#8217;t work. It&#8217;s also worth noting that all peer reviewed approaches in general requiring free labor are at risk from AI generated submissions, reviewer burnout etc. An ideal system would find a way to pay for good reviewers!</p><p>Finally, in the post-publication metrics approach,  metrics are used to sort research and researchers. Examples include altmetrics, which relies on &#8220;attention&#8221; broadly construed, or the venerable h-index, which relies on citations. By and large academics hate these. Citation metrics, in particular, have become gamed significantly, with citation collusion rings being discovered regularly. H-index, also, of course, still relies on the paper as the unit of scientific currency. Metrics have had some impact on how scientists are judged, but everyone would say they&#8217;ve been corrupted. The last thing we need is to reduce scientists to a few incomplete and corruptible metrics.</p><p>And yet, I think AI-enabled metrics computed over a structured representation of scientific knowledge will be part of the answer&#8230;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!y1Tv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60efbac8-9cfc-4d9b-9b46-49bf4f52b042_500x750.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!y1Tv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60efbac8-9cfc-4d9b-9b46-49bf4f52b042_500x750.jpeg 424w, https://substackcdn.com/image/fetch/$s_!y1Tv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60efbac8-9cfc-4d9b-9b46-49bf4f52b042_500x750.jpeg 848w, https://substackcdn.com/image/fetch/$s_!y1Tv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60efbac8-9cfc-4d9b-9b46-49bf4f52b042_500x750.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!y1Tv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60efbac8-9cfc-4d9b-9b46-49bf4f52b042_500x750.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!y1Tv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60efbac8-9cfc-4d9b-9b46-49bf4f52b042_500x750.jpeg" width="500" height="750" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/60efbac8-9cfc-4d9b-9b46-49bf4f52b042_500x750.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:750,&quot;width&quot;:500,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!y1Tv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60efbac8-9cfc-4d9b-9b46-49bf4f52b042_500x750.jpeg 424w, https://substackcdn.com/image/fetch/$s_!y1Tv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60efbac8-9cfc-4d9b-9b46-49bf4f52b042_500x750.jpeg 848w, https://substackcdn.com/image/fetch/$s_!y1Tv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60efbac8-9cfc-4d9b-9b46-49bf4f52b042_500x750.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!y1Tv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F60efbac8-9cfc-4d9b-9b46-49bf4f52b042_500x750.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3>A proposed path</h3><p>Here&#8217;s the idea. We use AI to parse scientific research into atomic claims arranged in a nomological network. A nomological network is a way to represent scientific theories - the entities they operate over (the &#8220;ontology&#8221;) as well as their observable manifestations (the &#8220;operationalization&#8221;) and the relationships between them. The representation would also include the scope for the claim (e.g. how far it generalizes) and a list of auxiliary assumptions. The entities of this network can be things like &#8220;self-control&#8221;, &#8220;inflation expectations&#8221; or &#8220;ribosome&#8221;. For something like self-control, for example, we might have a number of different operationalizations, which could include a neurological signature or a survey instrument. The relationship between the entities and how we measure them is itself a claim with evidence. There are different kinds of claims one can make relating entities, like &#8220;Depression causes sleep disruptions&#8221; or &#8220;These gene SNPs correlate with educational attainment.&#8221; </p><p>Each claim also carries a posterior that reflects current evidence. And the system keeps track of provenance - i.e. which studies, how they were designed, and the various checks the evidence has passed (e.g. computational reproducibility). Moreover, as knowledge grows and evolves, the graph should too. Scientists (and AIs) could propose an entity splitting in two, merging (e.g. ego depletion is just fatigue!) proposing a new construct, contesting an operationalization, etc. And ultimately it&#8217;s the community of scientists, aided by powerful AI systems, that will govern<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> and decide how this system reflects scientific consensus, or the lack thereof. </p><p>Producing something like this at scale, let alone keeping it up to date, would have been an enormous and massively expensive task to do in the past. But with current AI systems and the promise of even more capable future AI systems, building something like this is possible<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>. Humans can and likely will (especially at first) contribute to overseeing the creation of the overall ontology, as well as contributing much needed evaluation to ensure it&#8217;s correct. In fact, reading the foundational literature of a scientific field and then carefully making sure it&#8217;s well represented in this system is just the kind of task that would be useful for a beginning grad student to do.</p><p>Then the next step is to be able to quantify the contributions to science by changes to this graph over time. Once you create such metrics, such a system could attribute incremental improvements to our understanding to the scientists who caused it. Here&#8217;s how that could work, over time, to replace the scientific journal as a record of contribution for scientists. Each stage brings in a new set of users: research scientists first, then AI systems and policy makers, then everyone downstream of better-funded and better-targeted science.</p><h4>Stage one: living layers form alongside journals</h4><p>Initially, the system starts as a tool which allows research scientists to assess the scientific literature. They upload a set of published papers they locate via a search engine, and then the system extracts the claims, applies forensic audit, runs computational reproducibility where data is available, and outputs a claim-level posterior. The scientist can set a search that automatically updates the analysis when new papers come in. During this time, it&#8217;s mostly used by scientists and investigators, who are able to use the evidence aggregation and forensic tools to quickly assess a scientific literature. They are willing to use it despite its flaws, and in doing so, help provide the feedback and data needed to make LLM-based claim extraction highly accurate. They use it mainly to expose poor quality evidence and publish meta-analyses of their own. Some of these become the first living evidence systems. Therefore, at first, the coverage of this living layer is patchy; limitations in AI quality, limitations in funding, and limitations in human bandwidth to evaluate model outputs mean that it may take a few years for this system to gain momentum and coverage. Meanwhile, coordinating efforts, injections of philanthropic funding, and enabling technologies (e.g. turning PubMed into structured data AIs can accurately ingest) make investments in this burgeoning technology and employ underemployed researchers to help expand coverage of the system. During this time, journals continue to certify individual papers. </p><p>The turning point is when the review tool passes a threshold where it&#8217;s almost entirely accurate. At that point, an investigator makes their first set of major discoveries. A major area of research is found by the AI system to be unreliable or fraudulent, despite dozens of publications in top journals. Everyone takes notice. All of a sudden, the journal brand has been dented - the reliability of research is evaluable outside the traditional journal system. People start to have doubts about the journals. At this point, the system starts to really take off, and interest in scaling it starts to grow. More funding comes in, from governments, philanthropies, and AI companies themselves, who finally realize this is a valuable source of training data for them.</p><h4>Stage two: certification shifts toward provenance and audit</h4><p>As these evidence syntheses scale and become authoritative, certification of research quality follows naturally. More studies pass checks, and platforms (like <a href="https://ascollected.org/">asCollected</a>, which recently launched) provide data provenance verification to certify they were collected legitimately. Increasingly, the entire scientific workflow is instrumented and recorded, so the entire research process is logged, tracked and certified. As the system becomes more authoritative, its mistakes are increasingly rooted out quickly by the scientists themselves, who are incentivized to have their work analyzed correctly. So misunderstandings are smoothed out, and the system&#8217;s accuracy increases further.</p><p> In addition, organizations of scientists decide to aggregate their own expertise, enabling voting and a way for anyone to see scientists&#8217; consensus on claims<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a>. The system and its metrics continue to evolve as scientists&#8217; opinions change. The system is administered by a non-profit and is charged with making sure the metrics aren&#8217;t gamed. Constituted by forensic data scientists and meta-scientists, this organization is charged with measuring and maintaining the way results and contributions are certified. This is meant to resist, for example, review cartels. The audits, and the evidence aggregation mechanisms themselves continue to improve as scientists propose new ways to measure how science is done.</p><p>The evidence for atomic claims is certified by surviving forensic audit, computational reproduction where applicable, and integration into the living evidence graph with appropriate weighting. Good studies sway the balance of evidence more than bad studies. A small number of scientists start getting jobs and promotions based on their contributions to the research graph rather than traditional publications. Once that starts happening, other scientists really stop and take notice. Scientists start relying upon these systems more and more as the system gets more comprehensive. The more comprehensive this is, and the more certification it provides, and the more it&#8217;s relied upon, the more incentive journals have to participate. They need to include their articles, even if including their papers erodes the articles&#8217; value as scientific artifact. Nonetheless, journals continue to play a role for narrative synthesis, theoretical work, and complex multi-claim research. They stop being the primary certification layer for atomic empirical claims, however.</p><p>As the system grows, the evidence layer becomes a direct input into AI training systems. Understanding the most up to date scientific knowledge becomes essential for both humans and AI scientists to quickly devise the best follow-on experiments. Meanwhile, the structuring of policy analysis has a dramatic effect on how government and philanthropic funds are deployed. It becomes very easy to understand the most up to date evidence for different global health interventions, speeding the spread of life-saving changes. Education policy improves with a clearer read on the data.</p><h4>Stage three: incentives realign and many journals wither</h4><p>Once there is a trusted self-updating evidence layer which can track and assign credit for scientific progress, it becomes increasingly common for scientists to directly submit their work to the evidence layer, bypassing traditional journal articles. At first, the amount a paper shifts the posterior on a named claim becomes a measurable and attributable way of rewarding scientists. But soon, more methods for measuring the impact of scientists proliferate. Some scientists make conceptual clarifications, while others make methodological improvements that clarify causal inference across dozens of empirical datasets at once. Others collect quality data that comprehensively adjudicates between competing theories<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a>.  There are so many different actions that are seen as furthering human and machine understanding. Scientists are rewarded for all of it. As researcher evaluation moves toward more holistic forms, these metrics become key contributors.</p><p>At this point, textbooks start being created automatically from these living meta-analyses. Public-facing explainers emerge alongside them and the living meta-analyses now provide the context layer for search engines and for journalists writing news stories. The general public becomes better informed, and it&#8217;s easy for anyone to see what the latest research suggests people should do in order to improve their health.</p><h3>Caveats and complications</h3><p>There are a number of boundary conditions on this vision, of course &#8212; some types of scientific work aren&#8217;t amenable to this kind of structured decomposition. As always, there are legitimate fears that we will accidentally disincentivize some key aspect of scientific practice not legible to the machine. But I want to resist the idea that because metrics are imperfect that we shouldn&#8217;t create them, or frankly that it&#8217;s possible to envision a world without them. We need some basis for selection, whether it&#8217;s grants or jobs<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a> - the overthrow of metrics would only lead us to a world that prioritizes who you know &#8212; networking and nepotism<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a>. Moreover, the more quickly scientists work via AI, the more telemetry and data artifacts will be created. Ultimately, that may make the creation of these new metrics inevitable.</p><p>Of course it&#8217;s possible that a much worse version of this plays out. First and foremost, journals could end up co-opting and bankrolling the creation of these systems, using their massive catalogue of papers. They could steamroll any attempts to release public data, and win fair use cases that allow them to maintain their monopoly on the production of knowledge. It would be unfortunate if the infrastructure of science were privately owned. It should be a public good and ultimately maintained by scientists. </p><p>Another risk is that this AI system is opaque, full of subtle flaws that make it unreliable upon close inspection. This is why human collaboration is key. Especially at first, we&#8217;ll need the system to have confidence scores, to have humans audit high-stakes nodes, and for all the system&#8217;s interpretations of papers to be transparent and auditable. It&#8217;s crucial that the system be as open as possible to foster trust. This is another reason why the institution that maintains this system should be a non-profit entity.</p><p>It&#8217;s also possible that we end up with a system that fails to incentivize researchers properly, and ends up with more fruitless gaming. Goodhart&#8217;s law is a reality, but that doesn&#8217;t mean that all metrics are bad everywhere always. There are better and worse uses of metrics. Goodhart&#8217;s law isn&#8217;t solvable, but it can be mitigated through the correct institutional structures.</p><h3>Two cheers for metrics</h3><p>To mitigate  co-option or gaming, metrics need to be flexible and support human judgment. To that end, I want to propose three principles to guide the development of new metrics for valuing scientific contributions.</p><h4>Values made explicit, and tunable</h4><p>Metrics are always a reflection of what we value<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a>. Those values are often implicit: &#8220;we value writing papers which get cited a lot.&#8221; Goodhart&#8217;s law still holds; but if we make those values explicit and open, then at least we have the possibility of creating better, more responsive metrics. There are many different scientific values we could care about: novelty, risk-taking, rigor, methodological improvements. All of these and more can be operationalized and included. </p><h4>Metrics must continue to evolve and respond</h4><p>The conservatism of academia is such that the few metrics they have are completely frozen. This is a major problem. Consider an alternative model for how metrics can be responsive to efforts to game them: Google search. The Google search ranking algorithm continues to change as researchers find ways to improve it, but also in response to efforts to game it. We know approximately the kinds of signals that Google uses to rank, but we aren&#8217;t 100% sure. The real but bounded transparency balances the need for people to be incentivized to create good content for Google to rank with the need for secrecy so that the algorithm can&#8217;t be readily reverse engineered. For instance, some of the features that enable sleuths to detect fraudulent papers might need to be secret for a while so that we can better use them. Eventually, those become public and (unfortunately) bad actors will adapt. As stewards of the scientific enterprise, we&#8217;ll have to adapt too. Moreover, Google runs experiments to test changes to its ranking - there is a meaningful control. Decisions which hinge on these metrics, like grant decisions, are well served by some decisions made by an alternative procedure for comparison. This is the model we should use when thinking about how to create metrics around the scientific process.</p><h4>Metric-informed, not metric-driven decision-making</h4><p>Finally, metrics become tyrannical once they become full substitutes for human judgment. There are still &#8220;broken leg&#8221; cases. This is a term Paul Meehl used to describe rare circumstances that a human knows are important but sit outside a predictive model. Imagine an actuarial model is designed to predict if a professor will go to the movies on any given night. The model predicts a 90% probability based on historical data. However, the expert intuitively knows the professor just broke their leg. The expert intuitively adjusts the probability to 0%. The bureaucratic nightmare is one where it&#8217;s obvious to any human with a brain and a heart that the rules do not apply, and yet the bureaucrat doesn&#8217;t make an exception. To be data-informed means to use the data as a tool and not to be a tool of the data. Of course, Meehl&#8217;s point was that humans call for exceptions far more than is warranted, so of course there&#8217;s a balance.</p><h3>Conclusion</h3><p>Despite reasons for concern and pessimism, there is a real prize for humanity to be won if we steward this carefully. A claim level living evidence layer will earn scientists&#8217; and the public&#8217;s trust as it improves. As it grows in scope and quality, the clarity it brings as to the most trustworthy science will start to shift academic culture. If we build it correctly, we can solve a problem that is only growing in importance with the advent of generative AI. Namely, how to use AI to safely expand the research enterprise, ensure high quality scientific work, and credit the best research and researchers. The current journal system is roughly sixty years old in the form we know it. It survived the move from print to web, from subscription to open access. It probably won't survive the move from paper-as-unit to claim-as-unit, because that move dissolves what the journal was actually doing. The state withers, in this case, because something better is doing its job.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><h3></h3><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Physics and arxiv has been one partial success story, but physics journals still confer prestige. In computer science you have conference proceedings, but so it&#8217;s on a different timescale, but the basic way it works is similar.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Governance, voting, mechanism design, aggregation of knowledge, membership and identity verification. These are all extremely complex issues that I&#8217;m mostly punting on. </p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>In my <a href="https://www.paullitvak.com/p/we-can-create-the-future-of-science">previous essay</a>, I outlined a concrete proposal for starting this in one narrowly useful domain - randomized control trials.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>I&#8217;m not an expert on mechanism design, but there are a ton of interesting ways to aggregate preferences, e.g. quadratic voting, Bayesian truth serum, that could be worth considering. There are a lot of very smart economists who think about these kinds of questions who I&#8217;m sure will have great ideas.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>When I consider all the types of theories, I always think about Murray Davis&#8217; <a href="https://proseminarcrossnationalstudies.wordpress.com/wp-content/uploads/2009/11/thatsinteresting_1971.pdf">That&#8217;s Interesting</a>, who created a very handy classification for theoretical moves. Some of these lend themselves cleanly to graph operations, while others less so because they imply more value judgments:</p><p><em>Single phenomena:</em></p><p>(i) <strong>Organization.</strong> (a) What seems disorganized is organized. (b) What seems organized is disorganized.</p><p>(ii) <strong>Composition.</strong> (a) What seems heterogeneous is composed of a single element. (b) What seems unitary is composed of heterogeneous elements.</p><p>(iii) <strong>Abstraction.</strong> (a) What seems individual is holistic. (b) What seems holistic is individual.</p><p>(iv) <strong>Generalization.</strong> (a) What seems local is general. (b) What seems general is local.</p><p>(v) <strong>Stabilization.</strong> (a) What seems stable is unstable. (b) What seems unstable is stable.</p><p>(vi) <strong>Function.</strong> (a) What seems to function ineffectively functions effectively. (b) What seems to function effectively functions ineffectively.</p><p><em>Multiple phenomena:</em></p><p>(vii) <strong>Evaluation.</strong> (a) What seems bad is good. (b) What seems good is bad.</p><p>(viii) <strong>Co-relation.</strong> (a) What seem unrelated are correlated. (b) What seem related are uncorrelated.</p><p>(ix) <strong>Co-existence.</strong> (a) What seem compatible are incompatible. (b) What seem incompatible are compatible.</p><p>(x) <strong>Co-variation.</strong> (a) What seems positive co-variation is negative. (b) What seems negative co-variation is positive.</p><p>(xi) <strong>Opposition.</strong> (a) What seem similar are opposite. (b) What seem opposite are similar.</p><p>(xii) <strong>Causation.</strong> (a) What seems the cause is the effect. (b) What seems the effect is the cause.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>Maybe if we end up with fully automated luxury communism, everyone can be a yeoman researcher and grants will be as plentiful as gumdrops. For now though, the competition for jobs and grants remains.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>Robyn Dawes was known for his work on the many problems in human interviews as a screening process. Without metrics, we&#8217;d have to rely even more on the biases of human judges.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p>Thanks to Katie Corker for suggesting this.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Grant-making is broken, trauma as trapped priors, pointers to non-duality]]></title><description><![CDATA[Links 5/15]]></description><link>https://www.paullitvak.com/p/grant-making-is-broken-trauma-as</link><guid isPermaLink="false">https://www.paullitvak.com/p/grant-making-is-broken-trauma-as</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Fri, 15 May 2026 15:38:11 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<ol><li><p>Kevin Munger in another smart piece, points out that the grant-making system is already breaking due to AI. Now is an opportunity to design something better, an opportunity so far, that European grantmakers are not taking. Instead they are imploring researchers to slow down and apply for fewer grants. But why would they when the incentives point in the opposite direction? <a href="https://kevinmunger.substack.com/p/jubilee-for-dysfunctional-institutions">Jubilee for Dysfunctional Institutions &#8212; Kevin Munger</a></p></li><li><p>Max Shen writes on trauma and the mind-body connection. This short piece argues that it&#8217;s better to think about trauma as a trapped prior / prediction rather than some kind of stored pain, but that Kotler et al miss the fact that these priors can be trapped in the body, and not just the brain.<a href="https://substack.com/home/post/p-196372643">Contra Kotler, Friston et al. on &#8216;The body keeps the score&#8217; &#8212; Max Shen</a></p></li><li><p>Matthew Gindin on the failings of Western Advaita communities. The key takeaway for me is not to view statements about non-duality as philosophical doctrine so much as they are experiential pointers. People (i.e. lots of people who &#8220;practice&#8221; Advaita) who view this as purely a philosophical position are missing the point. <a href="https://matthewgindin.substack.com/p/why-is-the-western-nonduality-scene">Why Is The Western Nonduality Scene So Nuts? &#8212; Matthew Gindin</a></p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[AI access inequities, bad philosophical inquiries, and research iniquities]]></title><description><![CDATA[Links 5/14]]></description><link>https://www.paullitvak.com/p/ai-access-inequities-bad-philosophical</link><guid isPermaLink="false">https://www.paullitvak.com/p/ai-access-inequities-bad-philosophical</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Thu, 14 May 2026 15:54:36 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<ol><li><p>Anton Leicht makes a strong case that we need be worried about differential access to frontier models as an equity issue. This is a strong case for building as many data centers as we can. Although this is counter to much of the anti-AI discourse, this seems right to me. Though given the increasing security risks, it seems obviously true that widespread availability of frontier models is ending.<a href="https://writing.antonleicht.me/p/cut-off">Cut Off &#8212; Anton Leicht</a></p></li><li><p>Kasra with a great piece on why philosophical training can make you easier to fool: skill at constructing and deconstructing arguments doesn&#8217;t track truth without introspective attunement to the emotional motivations behind belief. My uncle calls this phenomenon being &#8220;too smart to learn.&#8221; <a href="https://www.bitsofwonder.co/p/harder-to-be-fooled-easier-to-fool">Easier to fool &#8212; Kasra</a></p></li><li><p><em>The Scientist</em> surveys eight research-integrity stories from 2025 &#8212; I hadn&#8217;t even heard of most of these stories, and this is a topic I&#8217;m following closely on a regular basis! <a href="https://www.the-scientist.com/inside-the-scientific-community-s-research-integrity-crisis-74391">Inside the Scientific Community&#8217;s Research Integrity Crisis &#8212; The Scientist</a></p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[A tribute to a mother, spiritual bypassing, and better DEI]]></title><description><![CDATA[Links 5/13]]></description><link>https://www.paullitvak.com/p/a-tribute-to-a-mother-spiritual-bypassing</link><guid isPermaLink="false">https://www.paullitvak.com/p/a-tribute-to-a-mother-spiritual-bypassing</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Wed, 13 May 2026 16:00:13 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<ol><li><p>Jose Luis Ricon wrote this touching tribute to his mother and the experience of sitting at his mother&#8217;s deathbed. I was especially moved by how he shared his self acceptance with his mother and the deep gratitude for her that the piece is suffused with. <a href="https://nintil.com/mom/">There was only mom &#8212; Jose Luis Ricon</a></p></li><li><p>Bill Klein on a Kegan-shaped misreading inside meditation circles: the &#8220;cool, unaffected&#8221; stance that practitioners often pursue is usually dissociation (spiritual bypass I dare say?). Defensiveness, sensitivity, and overwhelm can be evidence of a pending stage transition rather than failure to be enlightened. The move is &#8220;<em>aligning ourselves with embodied awareness</em>: opening fully to emotion and sensation, without interpretation, in relational context, even when it&#8217;s excruciating.&#8221; <a href="https://logidelic.substack.com/p/kegans-3rd-order-of-mind">Kegan&#8217;s 3rd Order of Mind &#8212; Bill Klein</a></p></li><li><p>Rachel Kleinfeld: current DEI programs may be increasing the very divisions they claim to address &#8212; a summary of the empirical literature on backlash and a mechanism-aware redesign. I realize this is a radioactive topic, but I really appreciated Kleinfeld&#8217;s piece. It admitted problems, lauded the overall intent and goals of DEI, and reoriented the question toward how to best achieve those goals. I love the spirit of this: &#8220;One form of rigor that is particularly essential to doing diversity differently is an end to trigger warnings and the fear of identity harm they entail. Diversity courses should help students get curious about one another and about themselves&#8212;not shut down questioning as too sensitive or hurtful.&#8221; <a href="https://www.persuasion.community/p/how-to-fix-dei">How To Fix DEI &#8212; Rachel Kleinfeld</a></p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Self-reckoning, multiverse analysis, aphantasia]]></title><description><![CDATA[Links 5/12]]></description><link>https://www.paullitvak.com/p/self-reckoning-multiverse-analysis</link><guid isPermaLink="false">https://www.paullitvak.com/p/self-reckoning-multiverse-analysis</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Tue, 12 May 2026 15:56:10 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<ol><li><p>Jeff Giesea, reviewing Alison Espach&#8217;s <em>The Wedding People</em>, on periodic self-reckonings &#8212; what it looks like to accept a truth about yourself you&#8217;ve been avoiding. My mid-life has been a constant (sometimes unpleasant) process of coming face to face with all my insanity and shadow and somehow being ok with it. <a href="https://jeffgiesea.substack.com/p/you-need-to-come-out-to-yourself">You need to come out to yourself right now &#8212; Jeff Giesea</a></p></li><li><p>Kurtis Hingl on AI enabling multiverse analysis of scientific papers in the (approaching?) post-PDF era. Beyond sampling and design-based uncertainty, there&#8217;s a third kind &#8212; the <em>non-standard error</em>, or what would have happened if a different researcher had made different defensible choices at each node of the analytical pipeline. Hingl proposes that AI can map the path-space cheaply enough to actually report it. <a href="https://hunchbox.substack.com/p/science-is-still-persuasion">Science is still persuasion &#8212; Kurtis Hingl</a></p></li><li><p>Hollis Robbins on living without voluntary mental imagery &#8212; about 2-4% of people are aphantasic, and recent data (Eker et al. 2024) suggests aphantasic students earn higher undergraduate grades than peers. ~60% still have visual dreams, which means the pathways are intact and what&#8217;s missing is voluntary control. I&#8217;m endlessly interested in this topic, because I&#8217;m mostly aphantasic, though as I&#8217;ve done more visualization work, it feels like it&#8217;s improved a little bit. I do wonder whether verbal or conceptual acuity is sort of a recompense. <a href="https://hollisrobbinsanecdotal.substack.com/p/aphantasia-and-mental-modeling">Aphantasia and Mental Modeling &#8212; Hollis Robbins</a></p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Apocalypse soon, management capacity, ethics and rules ]]></title><description><![CDATA[Links 5/11]]></description><link>https://www.paullitvak.com/p/apocalypse-soon-management-capacity</link><guid isPermaLink="false">https://www.paullitvak.com/p/apocalypse-soon-management-capacity</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Mon, 11 May 2026 15:38:02 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<ol><li><p>I enjoyed this Adam Mastroianni article from late 2024 on apocalyptic beliefs - obviously still relevant! He argues that they are so common across cultures because of the nature of comparing the past to the present . Because the negativity of the past fades faster than positivity, the ratio of good to bad memories is higher in the past than the present. Seems like a reasonable mechanism and fits the data, though Mastroianni stops short of actually proving this is the root cause. <a href="https://www.experimental-history.com/p/the-end-is-nigh-and-heres-why">The end is nigh and here&#8217;s why &#8212; Adam Mastroianni</a></p></li><li><p>Another great Dan Davies article where he applies cybernetics and public choice to outsourced state functions and identifies &#8220;management capacity&#8221; &#8212; the ability to process information and respond to feedback &#8212; as the load-bearing missing quantity in modern bureaucracy. Applying this framework to science, we can see that the slowness of scientific communication reduces its responsiveness <a href="https://hypertext.niskanencenter.org/p/taming-the-unaccountability-machine">Taming the unaccountability machine &#8212; Dan Davies</a></p></li><li><p>Max Langenkamp on why ethicists behave no better than other academics, with the operative analogy being that thought experiments are to moral behavior as bike manuals are to bike riding. Wholeheartedly agree. Being good isn&#8217;t about knowing rules. Ties somewhat to Brewer&#8217;s article I linked yesterday - when pressure is on, rules forsake you because executive function shuts down. Knowing the rules won&#8217;t help. <a href="https://maxlangenkamp.substack.com/p/you-do-not-need-ethics-to-be-good">You do not need &#8216;ethics&#8217; to be good &#8212; Max Langenkamp</a></p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Chickens, anxiety, pointillism in science and spirituality]]></title><description><![CDATA[Links 5/10]]></description><link>https://www.paullitvak.com/p/chickens-anxiety-pointillism-in-science</link><guid isPermaLink="false">https://www.paullitvak.com/p/chickens-anxiety-pointillism-in-science</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Sun, 10 May 2026 16:09:40 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<ol><li><p>Elias Schmied looks to chickens as the analogy for what AI takeoff might do to humans: chickens around 1700 had a ~100x population gain, supplemental feed, and free-range coops, which was an unprecedented increase in chicken welfare. And then factory farming happened. <a href="https://eliasschmied.substack.com/p/why-i-am-not-as-impressed-by-human">Why I am not as impressed by human progress as I used to be &#8212; Elias Schmied</a></p></li><li><p>Jud Brewer argues lifestyle medicine has the right pillars but the wrong delivery model: willpower-and-education breaks because the prefrontal cortex is the first region to go offline under stress, and a worry loop reinforced thousands of times has stored worry as carrying reward in the orbitofrontal cortex. Instead he advocates for awareness based practices, where you simply start by calling attention to the entire cycle of behavior and fully feeling it. <a href="https://judbrewer.substack.com/p/doing-everything-right-and-still">Doing Everything Right and Still Anxious &#8212; Jud Brewer</a></p></li><li><p>Thomas Insel on small interventions in mental health, and I find myself wondering, do I believe these? 20 minutes of Tetris after a traumatic memory cuts intrusive thoughts by ~70%, hearing aids beat antidepressants for older-adult depression, and Zimbabwe&#8217;s Friendship Bench has grandmothers delivering problem-solving therapy. Science is like a pointillist painting -- it&#8217;s beautiful from afar but up close it looks like a lot of smudges. I was talking to a professor this week who was lamenting being a killjoy. I feel this constant specter of skepticism too. And yet there&#8217;s faith too. The faith I feel is painting-like too; I don&#8217;t take any particular phenomenon as real there, but the overall picture is divine. <a href="https://thomasinsel352222.substack.com/p/a-video-game-a-postcard-and-a-grandmother">A Video Game, a Postcard, and a Grandmother &#8212; Thomas Insel</a></p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Links 5/9]]></title><description><![CDATA[Technology and ritual, quantitative reasoning in ethics, and a brownie recipe]]></description><link>https://www.paullitvak.com/p/links-59</link><guid isPermaLink="false">https://www.paullitvak.com/p/links-59</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Sat, 09 May 2026 17:35:22 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<ol><li><p>Ted Gioia&#8217;s observations on the importance of ritual and how technology often creates a false facsimile of it. &#8220;When technology truly empowers life and promotes human flourishing, the results are ritualistic.&#8221; <a href="https://www.honest-broker.com/p/13-observations-on-ritual">13 Observations on Ritual &#8212; Ted Gioia</a></p></li><li><p>Richard Chappell on quantitative reasoning in ethics &#8212; against the all-or-nothing framing where numbers either dictate action or get dismissed as soulless. The basic point he&#8217;s making is that the alternative to quantitative reasoning is worse even if doing moral reasoning with numbers is complicated, murky or fraught. I like reading Richard because he offers the strongest form of arguments that I often disagree with. <a href="https://www.goodthoughts.blog/p/good-judgment-with-numbers">Good Judgment with Numbers &#8212; Richard Y Chappell</a></p></li><li><p>David Lebovitz on adapting Honest Chocolat&#8217;s brownie via Phil Rosenthal&#8217;s <em>Somebody Feed Phil</em> cookbook. Lebovitz accounts of eating things in France are very fun to read, and his recipes are great too. I&#8217;ve made his rice pudding with salted caramel sauce and people loved it. <a href="https://davidlebovitz.substack.com/p/phils-brownies-57a">Phil&#8217;s Brownies &#8212; David Lebovitz</a></p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Links 5/8]]></title><description><![CDATA[Goodhart's law and the limits of AI, harmonizing with the Dao, and a kickass recipe]]></description><link>https://www.paullitvak.com/p/links-58</link><guid isPermaLink="false">https://www.paullitvak.com/p/links-58</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Fri, 08 May 2026 16:18:25 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<ol><li><p>Tom Reed: a datacenter of automated AI researchers, optimizing against internal evals, would only appear to approach superintelligence while actually optimizing benchmarks that fail to generalize. Reed points out that there&#8217;s no training data available for a machine to learn how to do most messy real-world tasks, e.g. where&#8217;s the training data for being a CEO? Related to my <a href="https://www.paullitvak.com/p/the-heideggerian-critique-of-current">piece</a> on the Heideggerian critique of AI. <a href="https://meagreprotestanthistory.substack.com/p/the-goodhart-singularity">The Goodhart Singularity &#8212; Tom Reed</a></p></li><li><p>Eric Schwitzgebel uses Xunzi and Zhuangzi to argue for harmonizing-with-the-dao as a fourth option in normative ethics, distinct from consequentialism, deontology, and virtue ethics. I&#8217;ve heard spiritual friends express a form of this idea, that you should fully embrace being in the flow of who you are, even if that means being a criminal! Be the best criminal you can be! Like with every ethical system, it breaks down, and probably needs guardrails. <a href="https://eschwitz.substack.com/p/the-ethics-of-harmonizing-with-the">The Ethics of Harmonizing with the Dao &#8212; Eric Schwitzgebel</a></p></li><li><p>Hetty McKinnon&#8217;s vegan spring lasagna: fresh sheets layered with a spinach-tofu cream (tofu, garlic, nutritional yeast, baby spinach, blended), fava beans, pesto, mozzarella; charred top under the broiler. Hetty McKinnon&#8217;s vegetarian cookbook, <a href="https://www.amazon.com/Tenderheart-Cookbook-Vegetables-Unbreakable-Family/dp/0593534867">Tenderheart</a>, is one of my favorite cookbooks of all time. Every recipe in there is so flavorful. Cannot recommend her enough. <a href="https://tovegetableswithlove.substack.com/p/lasagna-verde">Lasagna verde &#8212; Hetty Lui McKinnon</a></p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Links 5/7]]></title><description><![CDATA[AI thinking isn't thinking, being ok with not being ok, and Chinese regional cuisines!]]></description><link>https://www.paullitvak.com/p/links-57</link><guid isPermaLink="false">https://www.paullitvak.com/p/links-57</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Thu, 07 May 2026 15:47:40 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<ol><li><p>This paper argues that AI models&#8217; thinking traces aren&#8217;t &#8220;real thinking&#8221; because swapped, noisy, and arbitrarily-truncated reasoning traces preserve or improve solution accuracy. When I examine my own train of thought when problem solving, I also wonder whether my &#8220;real thinking&#8221; is load bearing. <a href="https://arxiv.org/html/2504.09762v3">Stop Anthropomorphizing Intermediate Tokens as Reasoning/Thinking Traces! &#8212; Subbarao Kambhampati et al.</a></p></li><li><p>Joan Tollifson writes a beautiful piece, steeped in paradox, about what it means to be ok with yourself, even with the parts of yourself that are not ok with you. I&#8217;ve definitely had the experience of accepting that something is unacceptable to me. <a href="https://joantollifson.substack.com/p/okay-with-being-just-as-you-are">Okay with Being Just As You Are? &#8212; Joan Tollifson</a></p></li><li><p>Chinese Cooking Demystified maps 63 distinct Chinese regional cuisines and substyles. There is a companion video if you don&#8217;t want to read all this, but it&#8217;s great. I found it fascinating to learn about how much variety there is in Chinese food. <a href="https://chinesecookingdemystified.substack.com/p/63-chinese-cuisines-the-complete">63 Chinese Cuisines: the Complete Guide &#8212; Chinese Cooking Demystified</a></p></li></ol><p></p><p>(FYI I moved the links to its own section so if these annoy you, you can unsubscribe to just the link posts)</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Links 5/6]]></title><description><![CDATA[The future of social science, group decision-making foibles, and the beauty of sustained attention...]]></description><link>https://www.paullitvak.com/p/links-56</link><guid isPermaLink="false">https://www.paullitvak.com/p/links-56</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Wed, 06 May 2026 18:59:24 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<ol><li><p>Kevin Munger on AI as an enabling condition for new forms of quantitative social science: a lot of the ideas in here should be familiar - decomposing the scientific PDF into structured, auto-updating artifacts organized around a scientific ontology. Great stuff. <a href="https://kevinmunger.substack.com/p/ai-allows-more-diversity-in-the-forms">AI Allows More Diversity in the Forms of Social Science &#8212; Kevin Munger</a></p></li><li><p>Timothy Burke applies Tetlock&#8217;s <em>Superforecasting</em> findings to institutional decision-making: the most successful forecasting teams adopted rules that <em>required</em> the group to argue even when they happened to agree &#8212; &#8220;no shelter for yes.&#8221; <a href="https://timothyburke.substack.com/p/no-shelter-for-yes">No Shelter for Yes &#8212; Timothy Burke</a></p></li><li><p>Henrik Karlsson writing a beautiful piece last year on the biology of sustained attention. When attention holds, dopamine, hormones, and working memory synchronize and reinforce; jhanas are the inverse of a panic attack; cortisol&#8217;s 60&#8211;90 minute half-life is why frequent task-switching leaves the system decohered. <a href="https://www.henrikkarlsson.xyz/p/attention">Almost Anything You Give Sustained Attention to Will Begin to Loop on Itself and Bloom &#8212; Henrik Karlsson</a></p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Links 5/5]]></title><description><![CDATA[Two of these are about spirituality, not that you're keeping track...]]></description><link>https://www.paullitvak.com/p/links-55</link><guid isPermaLink="false">https://www.paullitvak.com/p/links-55</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Tue, 05 May 2026 16:03:04 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<ol><li><p>A fake skin condition called bixonimania, seeded by Almira Osmanovic Thunstr&#246;m into two preprints in spring 2024, was cited as real by Bing, Gemini, Perplexity, and ChatGPT within weeks. This is a great example of why we need some kind of real grounded layer of scientific evidence. Relying on the foundation models by themselves to know what&#8217;s real doesn&#8217;t work. <a href="https://www.nature.com/articles/d41586-026-01100-y">Scientists invented a fake disease. AI told people it was real &#8212; Chris Stokel-Walker</a></p></li><li><p>L.M. Sacasas&#8217;s piece provides a very compelling metaphor for the way social media affects our minds: digital platforms are doing to inner life what land enclosure did to the commons &#8212; turning the psyche into a resource to be managed and extracted. &#8220;Resist the enclosure of the human psyche.&#8221; Our attention is our most valuable resource. <a href="https://theconvivialsociety.substack.com/p/the-enclosure-of-the-human-psyche">The Enclosure of the Human Psyche &#8212; L.M. Sacasas</a></p></li><li><p>Evan Erickson on a failure mode in opening-awareness meditation: if your basic orientation toward experience is one of pushing-away rather than welcoming, the whole practice quietly becomes a more refined form of avoidance. In other words &#8220;remain uninvolved&#8221; as a meditation instruction can lead to a kind of bypassing. <a href="https://emframes.substack.com/p/unlearning-default-awayness">Unlearning Default Awayness &#8212; Evan Erickson</a></p></li></ol><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Links 5/4]]></title><description><![CDATA[Fascinating experiment which tests different organizational forms for AIs in solving complex problems - solo agent versus delegation to subagents versus markets.]]></description><link>https://www.paullitvak.com/p/links-54</link><guid isPermaLink="false">https://www.paullitvak.com/p/links-54</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Mon, 04 May 2026 17:50:45 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<ol><li><p>Fascinating experiment which tests different organizational forms for AIs in solving complex problems - solo agent versus delegation to subagents versus markets. Adding an explicit error correction loop for markets seemed like putting the thumb on the scale a bit in favor of markets. Takeaway: AIs are bad at delegating and breaking up problems in subproblems intelligently. <a href="https://www.strangeloopcanon.com/p/why-smart-planners-lose-to-simple">Why Coase needs Hayek &#8212; Rohit Krishnan</a></p></li><li><p>MacIver on C. Thi Nguyen&#8217;s <em>The Score</em>. Been meaning to read Nguyen&#8217;s book - his ideas seem very important in our metric driven era. MacIver&#8217;s long essay is worth reading for the account of the four horsemen of bureaucracy alone (those are: rules, replaceable parts, centralized control, and scale). <a href="https://drmaciver.substack.com/p/how-to-be-less-box-shaped">How to be less box-shaped &#8212; David R. MacIver</a></p></li><li><p>Alison Roman: Spanish tortilla is more a feeling than a recipe (isn&#8217;t a lot of great home cooking?) &#8212; potatoes confit-cooked in olive oil, six or seven eggs, flipped (flipping is the hardest part!) or broiled to set, the reserved oil making the aioli. <a href="https://alisoneroman.substack.com/p/spanish-tortilla-more-a-feeling-than">Spanish Tortilla: more a feeling than a recipe &#8212; Alison Roman</a></p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Links 5/3]]></title><description><![CDATA[These you might want to click on]]></description><link>https://www.paullitvak.com/p/links-53</link><guid isPermaLink="false">https://www.paullitvak.com/p/links-53</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Sun, 03 May 2026 16:12:09 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<ol><li><p>Sasha Chapin with a great little interview of Zen teacher Henry Shukman on the curriculum after koan training (mindfulness, support, absorption, awakening) and Shukman&#8217;s COVID-era head injury weakening his cognitive capacity and opening his heart: &#8220;when there&#8217;s no resistance to heartbreak, it&#8217;s actually not a problem. It becomes indistinguishable from just an open heart.&#8221; It goes without saying that Chapin&#8217;s writing is a gem, and my contribution is only linking to his older stuff which you maybe missed or forgot about. <a href="https://berkeleyalembic.substack.com/p/a-little-drop-of-sweet-surrender">A Little Drop of Sweet Surrender (a conversation with Henry Shukman) &#8212; Sasha Chapin</a></p></li><li><p>Jared Henderson&#8217;s curated list of free philosophy lecture series on YouTube &#8212; Dreyfus on Heidegger, Kagan on death, Sandel on justice, Brandom on Frege, Sadler&#8217;s paragraph-by-paragraph Hegel. Sometimes I watch these imagining David Chapman chastising me for indulging in philosophy. Guilty pleasure. <a href="https://jaredhenderson.substack.com/p/the-best-philosophy-lectures-on-youtube">The Best Philosophy Lectures on YouTube &#8212; Jared Henderson</a></p></li><li><p>Mi&#8217;sen (another spiritual practitioner with a culinary passion) on Lyon as a &#8220;power place&#8221; in French cooking, the Mothers of Lyon, and a chicken-in-vinegar recipe sitting between Alain Chapel and Simon Hopkinson &#8212; plus the historical claim that French cooks were the ones who saved post-war British food culture in the 60s&#8211;80s. Despite a number of visits to France, I haven&#8217;t been to Lyon yet, which is criminal. <a href="https://hometable.substack.com/p/lyon-tart-and-soul">Lyon: t&#8217;art &amp; soul &#8212; Mi&#8217;sen</a></p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Links 5/2]]></title><description><![CDATA[You don't even have to click on two of these...]]></description><link>https://www.paullitvak.com/p/links-52</link><guid isPermaLink="false">https://www.paullitvak.com/p/links-52</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Sat, 02 May 2026 17:18:46 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<ol><li><p>Victoria Lynn Carroll on dying as the answer to a long riddle, via a Death Cafe and the Bill Hicks roller-coaster bit. &#8220;Won&#8217;t it be fun to finally know?&#8221; A different kind of memento mori. I&#8217;ve read much of Andrew Holocek&#8217;s book on dying, Tibetan style, and didn&#8217;t see anything in there about fun! <a href="https://byfeel.substack.com/p/what-if-dying-was-fun">What If Dying Was Fun? &#8212; Victoria Lynn Carroll</a></p></li><li><p>@rohit4verse argues against getting on the latest tooling bandwagon for AI and instead focus on learning the fundamental primitives -- evals, context engineering etc. I have approximately 100 of these best practices type tweets saved. Each one is like a little time capsule of AI at a particular moment. I like this one since it goes against the grain of &#8220;use this specific tool&#8221; But I just spared you most of the need to read it. <a href="https://x.com/rohit4verse/status/2049548305408131349">What to Learn, Build, and Skip in AI Agents (2026) &#8212; @rohit4verse</a></p></li><li><p>The best Caesar dressing has dill and caraway in it, courtesy of Jeremy Salamon&#8217;s <em>Second Generation</em> by way of Emily Nunn. I just noticed this is behind a paywall. Saving you a click again - just put dill and caraway in your Caesar. Caraway is a super underrated seed - I typically only see it on rye bread. Another underrated spice? Fenugreek. Emily Nunn is cool though - even just her post titles give me cooking ideas. Worth clicking around to her free stuff. <a href="https://emilyrnunn.substack.com/p/salads-with-roots-from-two-absolute">The Best Caesar Salad You&#8217;ll Ever Eat &#8212; Emily Nunn</a></p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Links 5/1]]></title><description><![CDATA[Andy Hall (another must read) reports on a fascinating experiment in AI governance where AI tries to learn students&#8217; preferences and then represent them in a virtual legislature.]]></description><link>https://www.paullitvak.com/p/links-51</link><guid isPermaLink="false">https://www.paullitvak.com/p/links-51</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Fri, 01 May 2026 19:58:13 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<ol><li><p>Andy Hall (another must read) reports on a fascinating experiment in AI governance where AI tries to learn students&#8217; preferences and then represent them in a virtual legislature. How do you get an AI to understand your worldview? <a href="https://freesystems.substack.com/p/training-ai-to-govern-for-us">Training AI to Govern for Us &#8212; Andy Hall</a></p></li><li><p>Henry Farrell persuasively argues that the harms from social media don&#8217;t stem from disinformation, but rather from internalizing incorrect assumptions about the prevalence of certain views. Like for instance you might think that racist beliefs are much more common than they really are based on social media alone. <a href="https://www.programmablemutter.com/p/were-getting-the-social-media-crisis">We&#8217;re getting the social media crisis wrong &#8212; Henry Farrell</a></p></li><li><p>The Countess Caroline von Keyserlingk was Kant&#8217;s closest friend for thirty years, probably his lover, and the likely source of the French turn in his work around 1760. I loved this essay - it totally blew up my assumptions about Kant&#8217;s boring conventional life. <a href="https://neoprimitivism.substack.com/p/the-most-important-woman-in-kants">The Most Important Woman in Kant&#8217;s Life &#8212; Daniel Andreas</a></p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Links 4/30]]></title><description><![CDATA[Chaos edition]]></description><link>https://www.paullitvak.com/p/links-430</link><guid isPermaLink="false">https://www.paullitvak.com/p/links-430</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Thu, 30 Apr 2026 15:58:40 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<ol><li><p>Richard Danzig&#8217;s 2018 CNAS report, written before LLMs and aging scarily: &#8220;superiority is not synonymous with security,&#8221; the human-in-the-loop reassurance is too weak and steadily eroding, and &#8220;early is imperative, late is too late&#8221; for control of complex opaque autonomous systems. Just because we can doesn&#8217;t mean we should. <a href="https://www.cnas.org/publications/reports/technology-roulette">Technology Roulette &#8212; Richard Danzig</a></p></li><li><p>Kevin Kelly bets against the AGI-resolves-by-2029 vibe: AI keeps advancing in ways that <em>expand</em> our ignorance rather than answering it, and US-China duopoly, post-globalization social chaos, and AI-induced media-trust collapse compound for a 10&#8211;15 year stretch of uncertainty about uncertainty itself. This seems plausible to me. It feels like the world is more uncertain than it ever has been in my life. <a href="https://kk.org/thetechnium/our-uncertain-uncertainties/">Our Uncertain Uncertainties &#8212; Kevin Kelly</a></p></li><li><p>Or is that perception of everything becoming more uncertain a kind of illusion? Adam Mastroianni in this article from 2024 (ah, a simpler time): apocalyptic beliefs are so common across cultures because they feel <em>reasonable</em>, not because they feel good &#8212; extending his and Dan Gilbert&#8217;s &#8220;illusion of moral decline&#8221; work to explain why people persistently expect the world to end soon. Re-reading this piece, I am struck that they don&#8217;t really ever show that their proposed mechanisms (biased attention and biased memory) actually cause the illusion. <a href="https://www.experimental-history.com/p/the-end-is-nigh-and-heres-why">The end is nigh and here&#8217;s why &#8212; Adam Mastroianni</a></p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Links 4/29]]></title><description><![CDATA[I just want to acknowledge that it might seem weird for some of you to see metascience and spirituality and AI all juxtaposed.]]></description><link>https://www.paullitvak.com/p/links-429</link><guid isPermaLink="false">https://www.paullitvak.com/p/links-429</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Wed, 29 Apr 2026 18:40:52 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>I just want to acknowledge that it might seem weird for some of you to see metascience and spirituality and AI all juxtaposed. You can change your subscription settings to receive only metascience, though links will continue to be in the main section. There is actually a connection between all of these topics, which I promise to elucidate soon. In the meantime:</p><ol><li><p>Natalie Cargill on Olively, an AI app that rewrites your texts to your partner and decodes theirs back to you. Cargill very persuasively makes the case that AI-mediated communication is helping people avoid the real work of attachment repair. I think the reality is more nuanced -- when the AI has the right information it can do more than erode our coping abilities, and help facilitate real therapeutic insight. The full argument will require more than a blurb. <a href="https://nataliercargill.substack.com/p/just-stop-communicating">Just! Stop! Communicating! &#8212; Natalie Cargill</a></p></li><li><p>Erik Hoel diagnoses 21st century cultural stagnation as overfitting &#8212; algorithmic feeds and hyper-discriminatory measurement converge culture on narrow in-distribution outputs, with the AI em-dash tic as a canonical small-scale instance and &#8220;no better marker of culture&#8217;s unoriginality than everyone talking about culture&#8217;s unoriginality.&#8221; Big Erik Hoel fan - his book on the nature of consciousness, The World Behind the World, is a must read. <a href="https://www.theintrinsicperspective.com/p/our-overfitted-century">Our Overfitted Century &#8212; Erik Hoel</a></p></li><li><p>Ben Recht: p-values are a regulatory mechanism, not a measurement device. Recht takes this view maybe a bit further than I would, but viewing statistics as a mechanism for technocratic decision making seems largely correct. This view is one reason I&#8217;m so fixated on better evaluation of randomized control trials, the most load bearing method we have in policy decision-making. <a href="https://www.argmin.net/p/milton-friedmans-p-values">Milton Friedman&#8217;s p-values &#8212; Ben Recht</a></p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[We can create the future of science right now]]></title><description><![CDATA[Most of the parts we need are already being built]]></description><link>https://www.paullitvak.com/p/we-can-create-the-future-of-science</link><guid isPermaLink="false">https://www.paullitvak.com/p/we-can-create-the-future-of-science</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Wed, 29 Apr 2026 16:43:31 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!LgLY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F647ef08d-844b-4545-a518-fbf21f3a4a56_1016x1060.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<h3>The bottleneck</h3><p>At this point it is uncontroversial to say that science needs to stop using the PDF article as the unit of knowledge and currency. The unbearable slowness of scientific publishing, the profit motive and margins. I&#8217;m not saying anything new. The PDF also sucks because it&#8217;s hard to extract structured information from, which makes it hard to do evidence synthesis. As a result, we do much less evidence synthesis than is needed. And evidence synthesis ultimately undergirds most policy and medical decision making. I can see second by second real time odds for any sporting or newsworthy event, but a school board can&#8217;t see the best evidence on whether their 8th graders should be taught algebra. As a society we don&#8217;t treat this as an important problem. Again, not controversial. </p><p>Not only is the problem well understood, but the solution has already been laid out. What we need is AI-assisted living evidence synthesis - (1) an open knowledge graph of atomic claims (2) claims linked to evidence (3) assessment and synthesis of each piece of evidence (4) continuous updating with new data. </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>What few realize (yet) is that the technical capacity to build this vision for a significant portion of science already exists. Not only that&#8212; scientists and startup teams are already building many of these components. I know this because I&#8217;ve been surveying the space and talking to many of the builders. There are some missing pieces: for example evaluations of how well some of the components work. But at this point most of what&#8217;s missing is a fully end to end working integration of all of these parts. In the rest of this essay, I&#8217;m going to lay out all the parts of a working living evidence layer for science and who is working on them, and propose concrete next steps for building this system.</p><h3>What&#8217;s now possible</h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!LgLY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F647ef08d-844b-4545-a518-fbf21f3a4a56_1016x1060.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!LgLY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F647ef08d-844b-4545-a518-fbf21f3a4a56_1016x1060.png 424w, https://substackcdn.com/image/fetch/$s_!LgLY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F647ef08d-844b-4545-a518-fbf21f3a4a56_1016x1060.png 848w, https://substackcdn.com/image/fetch/$s_!LgLY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F647ef08d-844b-4545-a518-fbf21f3a4a56_1016x1060.png 1272w, https://substackcdn.com/image/fetch/$s_!LgLY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F647ef08d-844b-4545-a518-fbf21f3a4a56_1016x1060.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!LgLY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F647ef08d-844b-4545-a518-fbf21f3a4a56_1016x1060.png" width="1016" height="1060" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/647ef08d-844b-4545-a518-fbf21f3a4a56_1016x1060.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1060,&quot;width&quot;:1016,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:160607,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.paullitvak.com/i/195463251?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F647ef08d-844b-4545-a518-fbf21f3a4a56_1016x1060.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!LgLY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F647ef08d-844b-4545-a518-fbf21f3a4a56_1016x1060.png 424w, https://substackcdn.com/image/fetch/$s_!LgLY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F647ef08d-844b-4545-a518-fbf21f3a4a56_1016x1060.png 848w, https://substackcdn.com/image/fetch/$s_!LgLY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F647ef08d-844b-4545-a518-fbf21f3a4a56_1016x1060.png 1272w, https://substackcdn.com/image/fetch/$s_!LgLY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F647ef08d-844b-4545-a518-fbf21f3a4a56_1016x1060.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The diagram above outlines the components of a living evidence synthesis platform, including some of the teams working on each component<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. Scientific PDFs are processed into claims with associated evidence. The evidence is subjected to a forensic audit, methodological evaluation and robustness and reproducibility checks. Finally it&#8217;s given a weight in a continuously updating synthesis. What follows is a description and status of each component and a few of the teams working on them.</p><h4>Document understanding</h4><p>The first thing you need to be able to do is turn an article into structured data. Mostly that means parsing PDFs. There are often multicolumn layouts that confuse non-specialized PDF to text processing libraries. For scientific papers, there is the added complexity of parsing formulas and tables and figures. This is a really hot area - there are startups offering APIs, and it seems like a new open source package gets posted to Github every few weeks.  What follows isn&#8217;t exhaustive. A package called <a href="https://github.com/kermitt2/grobid">GROBID</a> was the state of the art for a while, they didn&#8217;t update their package for nearly two years until very recently. In the meantime <a href="https://reducto.ai/">reducto.ai</a> released an AI powered PDF extraction API, <a href="https://github.com/PaddlePaddle/PaddleOCR">PaddleOCR</a> became popular, IBM released a model called <a href="https://github.com/docling-project/docling">Docling</a>, and both <a href="https://mistral.ai/">Mistral</a> and <a href="https://ai.google.dev/gemini-api/docs/document-processing">Gemini</a> created models and libraries. I also know of at least one other well-funded psychology research group working on a paper parser. By contrast,  there are few open evals in this space, with no extensive evals for complex table comprehension in particular. Nonetheless, I'm confident this will be a solved problem soon, given the combination of LLM advances and developer interest.</p><h4>Hypothesis level extraction</h4><p>There has been increasing interest in comprehending the extracted text of papers and linking information to evidence for each hypothesis. A lot of work has already been done.  <a href="https://github.com/ijmarshall/trialstreamer">Trialstreamer</a> (<a href="https://academic.oup.com/jamia/article/27/12/1903/5907063">Marshall et al. 2020</a>) and <a href="https://pypi.org/project/robotreviewer/">RobotReviewer LIVE</a> (Marshall et al. 2023) demonstrated automated extraction of trial population, intervention, and outcome at scale on clinical RCTs. <a href="https://github.com/Future-House/paper-qa">PaperQA2</a> (<a href="https://arxiv.org/abs/2409.13740">Skarlinski et al. 2024</a>) and <a href="https://scholarqa.allen.ai/">Ai2 ScholarQA</a> (2024) extended this to retrieval-augmented question answering with citation grounding. <a href="https://elicit.com/">Elicit</a>, <a href="https://consensus.app/">Consensus</a>, and <a href="https://scispace.com/">SciSpace</a> operationalized claim-level extraction for end users. <a href="https://github.com/OpenEvalProject/evals">OpenEval</a> (<a href="https://www.biorxiv.org/content/10.64898/2026.01.30.702911v1">Booeshaghi et al. 2026</a>) is the most recent and most ambitious: 1.96 million atomic claims extracted from 16,087 eLife manuscripts using Claude Sonnet 4.5, grouped into ~299,000 results, with LLM evaluations showing 81% agreement with human peer review on a 2,487-paper subset. None of these solutions link claims to test statistics, as you would need to evaluate randomized control trials. This is why I built the <a href="https://evidence.guide/">evidence.guide</a> API - to extract hypotheses and associated test statistics from behavioral science papers. The best public eval of this kind of extraction I&#8217;m aware of comes from the recent <a href="https://www.darpa.mil/program/systematizing-confidence-in-open-research-and-evidence">SCORE project</a> - they had humans code thousands of psychology papers to extract their claims by hand. It would be extremely helpful to the world if all scientific PDFs were available as structured open data. I&#8217;ve been working to make this happen, both directly at Berkeley and through coordination with large entities I can&#8217;t yet speak of;  as hard as it is to do, I think it&#8217;s possible<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>.</p><h4>Forensic audit</h4><p>A lot of work has been done on forensic audit, but some gaps remain. Of course, for biology papers that rely on images for evidence, there are a variety of tools (notably <a href="https://www.proofig.com/">Proofig</a> and <a href="https://imagetwin.ai/">ImageTwin</a>) to spot anomalies. These are still well short of what sleuths like <a href="https://scienceintegritydigest.com/">Elizabeth Bik</a> can do on her own, but these tools are constant companions among fraud analysts. There&#8217;s someone working on auditing Excel files for anomalies, and a number of teams are automating numerical checks like GRIM and SPRITE, including <a href="https://lhdjung.github.io/scrutiny/">Scrutiny project</a>, the <a href="https://www.medrxiv.org/content/10.1101/2025.09.03.25334905v2">INSPECT-SR</a> team as well as <a href="https://statcheck.io/">statcheck</a>. The <a href="https://arxiv.org/abs/2601.13330">regcheck</a> team is building a way to use AI to compare preregistrations to analyses in papers, to ensure there aren&#8217;t significant deviations. Nonetheless, there are many other kinds of anomalies to screen for, both public and less publicly known. And there are no formal evals for anomaly detection that I&#8217;m aware of. Still, there&#8217;s a lot to draw from in this space and I&#8217;m pretty certain we will be able to scan papers for most kinds of obvious anomalies in the near future.</p><h4>Methodological review</h4><p>This area has been white hot, though I fear for many of the startups in this space, because this capability may become commoditized. There are at least six different AI peer review companies, including <a href="https://refine.ink/">Refine.ink</a>, <a href="https://reviewer3.com/">Reviewer3</a>, <a href="https://www.reviewerzero.ai/">ReviewerZero.ai</a>, <a href="https://www.qedscience.com/">Q.E.D. Science</a>, <a href="https://paper-wizard.com/">Paper Wizard</a>, and <a href="https://isitcredible.com/">Isitcredible</a>. <a href="https://coarse.ink/">Coarse</a> (a pun on refine) was also recently created as an open source alternative. These systems provide qualitative feedback on the content of papers, spotting methodological weaknesses and mathematical errors. They seem to work pretty well, and many academics report bitterly that they exceed the average quality of typical peer reviewers. But there are few evals here either. What evals exist so far involve using LLM-as-judge (circularity problems abound) or comparing against human reviews of questionable quality. What you&#8217;d ideally want is an eval that measures capturing known errors in papers<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>. </p><h4>Reproducibility and robustness</h4><p>Another active area has been using AI agents to automate computational reproducibility<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> and robustness<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a> checks in papers that report numerical results. For more recent papers where data and code are available, AI agents can see whether they can re-run the analyses and produce the numbers reported in the published paper. In addition to a handful of individual academics who have been experimenting using Claude Code for this, the <a href="https://i4replication.org/">Institute for Replication</a> is a leading group working on building an end to end system. The evals related to this problem are the most mature, with <a href="https://arxiv.org/abs/2409.11363">CORE-Bench</a> (Siegel, Kapoor, Narayanan 2024) and <a href="https://arxiv.org/abs/2504.01848">PaperBench</a> (OpenAI 2025) available to benchmark agents on this task. There is also work on getting AI agents to test alternative ways of analyzing the data to ensure the results are robust to small analytic design choices.</p><h4>Synthesis</h4><p>This is the most underdeveloped area where significant investment is required. Although some automated evidence synthesis systems exist &#8212; for example, <a href="https://ottosr.com/">otto-sr</a> is building an AI agent to write systematic reviews &#8212; none of these incorporate the full range of paper level signals to weight evidence appropriately. Nor is there anything like an eval or a gold standard for a good systematic review. Arguably <a href="https://www.cochranelibrary.com/cdsr/reviews">Cochrane reviews</a> are the closest we have to gold standard human systematic reviews, though I&#8217;ve heard academics in the know complain about their uneven quality. A key question for a synthesis platform is how to weight anomalies and methodological issues in assessing the quality of a piece of evidence. This is an unsolved problem and one I&#8217;m very keen to work on.</p><h4>Continuous updating</h4><p>There are many pieces of basic infrastructure available for monitoring for new research and initiating updates. <a href="https://openalex.org/">OpenAlex</a>  is the current open citation graph. <a href="https://retractionwatch.com/">Retraction Watch</a> integrated into <a href="https://www.crossref.org/">Crossref</a> in October 2023. <a href="https://scite.ai/">Scite</a> tracks how citations support, contrast, or mention prior claims. The <a href="https://community.cochrane.org/review-development/resources/living-systematic-reviews">Living Evidence Network</a> demonstrated continuous-update workflows in clinical guidelines. Engineering this is a relatively straightforward task. </p><p>When you look over this technical architecture and all the progress being made, it&#8217;s hard not to be optimistic that a living guide to scientific evidence will be built.</p><h3>The stakeholders are ready</h3><p>The social infrastructure for this is starting to coalesce &#8212; it&#8217;s not just a pie in the sky academic exercise to imagine this coming into existence. Institutions like the <a href="https://www.cos.io/">Center for Open Science</a>, the Institute for Replication, the INSPECT-SR, the Living Evidence Network and more are all working on scaling work to improve research quality. </p><p>Funders are also aligned. The <a href="https://sloan.org/">Sloan Foundation</a> has funded living evidence work through COS. <a href="https://coefficientgiving.org/">Coefficient Giving</a> supports the Institute for Replication and COS. The <a href="https://astera.org/">Astera Institute</a> and the <a href="https://ifp.org/">Institute for Progress</a> has shown interest in this space. NIH has established an <a href="https://www.nih.gov/replicationandreproducibility">Office for Replication and Reproducibility</a>. Although there are (very unfortunately) serious headwinds in science funding generally, there is an active group of funders interested in metascience.</p><p>A brief word about what I&#8217;ve been doing at RDI. First, as a Visiting Scholar at Berkeley I&#8217;ve been actively figuring out how a non-profit and a public university can conduct and make public the results of large-scale academic article data mining. With some of the money I raised from donors, I commissioned a legal analysis of recent case law and publisher text data mining (TDM) agreements in order to understand whether a massive open data mining of academic articles is possible (with caveats, it is). I&#8217;ve also been working to bring together stakeholders in this space, and identify gaps. I&#8217;ve also been doing some software development in this space, with more to come. </p><h3>A pilot proposal</h3><p>The assumption undergirding all of this is that an AI, given all this information, would make the right judgment about a scientific claim with lots of conflicting evidence, weighing all the factors appropriately. That&#8217;s the hypothesis we need to test. </p><p>Randomized control trial research is the best place to focus on first. RCTs are used to make many of the important decisions in society - from medical trials to public policy changes. And they use a relatively uniform set of inferential statistics with lots of known and available diagnostics. Behavioral science experiments, within the broader realm of RCTs, should be first used as a testbed whose results can be generalized. Because behavioral science is at the vanguard of open science practices, replications abound (there are thousands of them) to serve as ground truth training data. </p><h5>Key Hypothesis </h5><p>Therefore the pilot would test, in behavioral RCTs that have been replicated, whether the quality of evidence for a claim can be used to accurately predict whether that claim will replicate. </p><h5>Secondary Hypotheses</h5><ol><li><p>Compared to claims that replicated,  non-replicated claims demonstrate a greater share of forensic anomalies in their source literatures.</p></li><li><p>Hypothesis level claim and statistic extraction is accurate enough to  scale living evidence without onerous human review costs.</p></li><li><p>Replication prediction is more accurate than prediction markets<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a> or journal prestige.</p></li></ol><p>If all the different quality signals we gather do accurately predict which studies will replicate, then we can use that model to score evidence to power the living evidence layer.</p><h4>Why this is informative regardless of outcome</h4><p>If the pilot succeeds, the architecture extends to medical RCTs (where Living Evidence already operates and integration is mostly about claim representation), then to slices of basic biology with stable replication structure. If it fails, the field learns which quality signals are load-bearing and which ones metascience has oversold. Either result is a contribution to knowing what the literature supports.</p><h3>Implications for funders</h3><p>Because this burgeoning ecosystem of builders already exists, a well-informed philanthropic or government funder could play a crucial catalyzing role in bringing this future about. They could play at least three roles: creating open structured datasets, publishing open benchmarks and incentivizing the solving of key technical challenges.</p><p>First, open archives of papers that the government maintains, like PubMed, could be turned into structured data amenable to large scale metascience and claim aggregation. I know the US government already has an interest in doing this, though some key open questions remain unanswered. How do you determine the best models and systems for accurately extracting information from papers? How can you establish a robust way to allow researchers to flag errors and correct them? And finally, how do you create a legal regime in working with publishers to maximize the scope of available papers? For the latter, a university or a private philanthropy may be better positioned to make structured data publicly available under journal subscription terms or fair use.</p><p>Philanthropists or government funders could also coordinate to create or commission benchmarks that evaluate whether important problems have been solved. For example, an open benchmark for claim extraction from a range of different scientific article types would be extremely helpful. Ensuring the underlying data are accurate is vital for creating these evaluations. I&#8217;ve discussed opportunistically using various human-created datasets for this purpose, but a consistent problem is that errors in human data make it difficult for them to serve as a gold standard.</p><p>Finally, with structured data and benchmarks available, the government or private philanthropy could use them to incentivize groups to develop machine learning systems that meet these benchmarks. Prizes are one potentially valuable tool for this. For example, you could establish a prize for a system that accurately updates a living meta-analysis for a small set of claims. Prizes are particularly useful as signals of problem importance, and can help create vibrant ecosystems of public and private research&#8212;see, for example the role the government played in kickstarting the current work on nuclear fusion.</p><h3>Conclusion</h3><p>The drawbacks of the current scientific publishing system are known. Scientists agree, metascientists agrees, philanthropists agree: the published PDF plus citation graph isn&#8217;t the right substrate for maintaining a representation of the evidence base in science. The pieces needed to build the alternative either already exist or are rapidly taking shape. The community is forming around exactly this problem, with concrete partnerships and shared infrastructure. A pilot should start on behavioral science RCTs because that's the slice of empirical science most amenable to legibility, where replication ground truth is richest, and where the failure modes are best documented. What's been missing is the galvanizing mission to assemble these pieces into something that works. That's what I'm proposing to build. </p><p></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>I have a broader field map that I&#8217;ll release publicly soon. This is me, building in public!</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>If this is something that you are excited about, please reach out and talk to me.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>More on this very soon too!</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>This tests whether, given the code and the data, you can get the same statistics as reported in the published paper.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>This tests whether the results are the same as a paper&#8217;s given alternative analytical decisions in conducting the analysis (like outlier omission). Closely related is the idea of a &#8220;multiverse&#8221; where you come up with many different ways of answering the same underlying research question with the same data, and test whether the results hold in all those alternative methods. There&#8217;s been work on the latter as well.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>Some of the replications, e.g. those from the SCORE project, had paired the experiments with forecasts from prediction markets. So we get to look at this for free.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Links 4/28]]></title><description><![CDATA[New paper which scores 6,957 Organization Science submissions and 10,389 reviews with Pangram and find a 42% post-ChatGPT volume surge driven entirely by AI-flagged work, with editorial outcomes deteriorating sharply above a 30% Pangram threshold and the volume surge tracing to publication-count incentives at business schools.]]></description><link>https://www.paullitvak.com/p/links-428</link><guid isPermaLink="false">https://www.paullitvak.com/p/links-428</guid><dc:creator><![CDATA[Paul Litvak]]></dc:creator><pubDate>Tue, 28 Apr 2026 18:09:07 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!hCOn!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F09b955cb-1b0a-48c0-9114-da518a90c6b7_1070x1426.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<ol><li><p>New paper which scores 6,957 <em>Organization Science</em> submissions and 10,389 reviews with Pangram and find a 42% post-ChatGPT volume surge driven entirely by AI-flagged work, with editorial outcomes deteriorating sharply above a 30% Pangram threshold and the volume surge tracing to publication-count incentives at business schools. Fun fact: I know the last author (Lamar Pierce) from grad school and have a fun memory of him standing on one foot in a kitchen at a dinner party exclaiming &#8220;I&#8217;m really good at balancing.&#8221;  <a href="https://pubsonline.informs.org/doi/full/10.1287/orsc.2026.ed.v37.n3">More Versus Better: AI, Incentives, and the Emerging Crisis in Peer Review &#8212; Gartenberg, Hasan, Murray, Pierce</a></p></li><li><p>Dan Davies applies cybernetics and public choice to outsourced state functions and identifies &#8220;management capacity&#8221; &#8212; the ability to process information and respond to feedback &#8212; as the load-bearing missing quantity in modern bureaucracy. I&#8217;m a big fan of Davies&#8217; work. <a href="https://hypertext.niskanencenter.org/p/taming-the-unaccountability-machine">Taming the unaccountability machine &#8212; Dan Davies</a></p></li><li><p>John Psmith&#8217;s review of F.W. Mote&#8217;s <em>Imperial China: 900-1800</em> argues that barbarism and civilization aren&#8217;t a binary but a dial humans turn in response to incentives. Fun read, though I can&#8217;t remember why I saved this article a year ago&#8230; <a href="https://www.thepsmiths.com/p/review-imperial-china-by-fw-mote">REVIEW: Imperial China, by F.W. Mote &#8212; John Psmith</a></p></li></ol><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.paullitvak.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading In One Lifetime! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item></channel></rss>