<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Posts on ryi.me</title>
    <link>https://ryi.me/posts/</link>
    <description>Recent content in Posts on ryi.me</description>
    <generator>Hugo -- gohugo.io</generator>
    <language>en-us</language>
    <lastBuildDate>Mon, 06 Jul 2020 00:00:00 +0000</lastBuildDate><atom:link href="https://ryi.me/posts/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>Uplift model evaluation for randomized control trials</title>
      <link>https://ryi.me/posts/data-science/uplift-model-evaluation/</link>
      <pubDate>Mon, 06 Jul 2020 00:00:00 +0000</pubDate>
      
      <guid>https://ryi.me/posts/data-science/uplift-model-evaluation/</guid>
      <description>An overview of methods to evaluate the effectiveness of models in finding heterogeneous treatment effects in randomized control trials (&amp;ldquo;uplift models&amp;rdquo;), introducing two novel evaluation curves: the adjusted Qini curve and the efficiency curve.
Read the full paper (PDF)
Abstract Uplift models seek to estimate individual treatment effects, helping practitioners answer questions like &amp;ldquo;who should we target with our treatment&amp;rdquo; rather than simply &amp;ldquo;what is the individual treatment effect&amp;rdquo;. This paper provides a comprehensive overview of evaluation methods for such models, with particular focus on:</description>
    </item>
    
    <item>
      <title>An introduction to unbiased [and doubly robust] estimators</title>
      <link>https://ryi.me/posts/data-science/unbiased-estimators/</link>
      <pubDate>Mon, 28 Jan 2019 00:00:00 +0000</pubDate>
      
      <guid>https://ryi.me/posts/data-science/unbiased-estimators/</guid>
      <description>Often, data collection cannot be completely random &amp;ndash; e.g. in clinical trials, where it would be unethical to randomly treat people with medicine, or in online surveys, where response cannot be guaranteed. In such cases, data can be biased, so any inferences drawn or machine learning models built from this data will not generalize well to the overall population. This is where unbiased estimation can come in, in which small adjustments are effectively made to the dataset to make it more representative of a random sample.</description>
    </item>
    
    <item>
      <title>Emergent geometries of groundwater-fed rivers</title>
      <link>https://ryi.me/posts/data-science/emergent-geometries-groundwater-rivers/</link>
      <pubDate>Fri, 01 Dec 2017 00:00:00 +0000</pubDate>
      
      <guid>https://ryi.me/posts/data-science/emergent-geometries-groundwater-rivers/</guid>
      <description>My PhD thesis on emergent geometric complexity in natural systems, using river networks as a rich example of how simple constituent interactions produce novel structures and statistical properties across multiple scales.
Download PDF (65 MB) | MIT DSpace
Abstract This thesis explores emergence—how novel structures arise from collective interactions between constituent entities—through the lens of river network geometry. River networks exemplify emergent complexity: simple physical processes (erosion, diffusion, pressure gradients) interact to produce geometric patterns and power-law statistics that appear fundamentally different from their microscopic origins.</description>
    </item>
    
  </channel>
</rss>
