
  <rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
      <title>Duncan Leung</title>
      <link>https://www.duncanleung.com/blog</link>
      <description>📚 Learning, 👨🏻‍💻 Building, 📝 Documenting...</description>
      <language>en-us</language>
      <managingEditor>duncan@leungs.email (Duncan Leung)</managingEditor>
      <webMaster>duncan@leungs.email (Duncan Leung)</webMaster>
      <lastBuildDate>Wed, 17 Jun 2026 00:00:00 GMT</lastBuildDate>
      <atom:link href="https://www.duncanleung.com/tags/rag/feed.xml" rel="self" type="application/rss+xml"/>
      
  <item>
    <guid>https://www.duncanleung.com/blog/rag-reranking-cross-encoder-recall-precision</guid>
    <title>Reranking in RAG: Why Retrieval Needs a Second, Slower Opinion</title>
    <link>https://www.duncanleung.com/blog/rag-reranking-cross-encoder-recall-precision</link>
    <description>Vector search finds the right chunk but ranks it badly and waves through near-garbage that&#39;s vaguely on-topic. Reranking fixes that with a second model - a cross-encoder that reads the query and each chunk together instead of comparing two frozen vectors. The whole trick is affording an expensive judge by only running it on the cheap stage&#39;s survivors.</description>
    <pubDate>Wed, 17 Jun 2026 00:00:00 GMT</pubDate>
    <author>duncan@leungs.email (Duncan Leung)</author>
    <category>ai</category><category>rag</category><category>retrieval</category>
  </item>

    </channel>
  </rss>
