{"id":261439,"date":"2025-05-11T11:43:50","date_gmt":"2025-05-11T02:43:50","guid":{"rendered":"https:\/\/designcopy.net\/en\/how-alibaba-zero-search-trains-llms-without-google\/"},"modified":"2026-04-06T10:07:43","modified_gmt":"2026-04-06T01:07:43","slug":"how-alibaba-zero-search-trains-llms-without-google","status":"publish","type":"post","link":"https:\/\/designcopy.net\/en\/how-alibaba-zero-search-trains-llms-without-google\/","title":{"rendered":"How Alibaba\u2019s ZeroSearch Trains LLMs Without Google"},"content":{"rendered":"<p>In a bold move from <strong>Alibaba&#8217;s Tongyi Lab<\/strong>, <strong>ZeroSearch<\/strong> is flipping the script on how large language models (LLMs) handle searches. This <strong>reinforcement learning framework<\/strong> lets LLMs ditch <strong>real-time engines<\/strong> like Google or Bing. Instead, it uses another LLM to <strong>fake a search engine&#8217;s behavior<\/strong>. Imagine that\u2014one AI pretending to be another, churning out documents that might be spot-on or total junk. It&#8217;s all based on the LLM&#8217;s built-in knowledge from pre-training. Pretty clever, right?<\/p>\n<p>But here&#8217;s the blunt truth: LLMs often spit out <strong>outdated or made-up info<\/strong> because they&#8217;re stuck with old data. ZeroSearch steps in, teaching them to grab and use <strong>external info<\/strong> without breaking the bank.<\/p>\n<p>Now, don&#8217;t get me started on the limitations. LLMs are like that friend who quotes Wikipedia from 2015\u2014reliable until they&#8217;re not. They can&#8217;t fetch fresh facts, leading to fabrications that make you roll your eyes. ZeroSearch fixes this mess by training LLMs to simulate searches, handling <strong>noisy results<\/strong> like a pro. It&#8217;s all about reinforcement learning with a <strong>curriculum strategy<\/strong>\u2014start simple, then crank up the chaos. The <strong>policy model<\/strong> learns to deal with junk data, building resilience.<\/p>\n<p>Oh, and that masking mechanism? It keeps training stable, no drama.<\/p>\n<p>Efficiency? ZeroSearch slashes costs big time. Forget pricey API calls; this setup <strong>cuts expenses by 88 percent<\/strong>. A 3B parameter model pulls off <strong>realistic searches<\/strong> without a cent to external engines. Sarcastic side note: Who knew saving money could make AI training feel like a bargain bin find?<\/p>\n<p>Performance-wise, it&#8217;s no slouch. A 7B model <strong>matches Google<\/strong>, while a 14B one beats it. Additionally, a <a rel=\"nofollow noopener external noreferrer\" target=\"_blank\" href=\"https:\/\/www.marktechpost.com\/2025\/05\/10\/zerosearch-from-alibaba-uses-reinforcement-learning-and-simulated-documents-to-teach-llms-retrieval-without-real-time-search\/\" data-wpel-link=\"external\">3-billion parameter model<\/a> effectively simulates document retrieval with zero API cost. This framework scales across LLMs and algorithms, making it practical for real-world use.<\/p>\n<p>Technically, it kicks off with <strong>lightweight simulations<\/strong>, varying document quality to toughen up the model. Sure, it&#8217;s a bit like playing make-believe, but hey, it works.<\/p>\n<p>In the end, ZeroSearch proves that LLMs can evolve without leaning on giants\u2014innovative, efficient, and, dare I say, a little rebellious. Furthermore, it empowers the model to manage <a rel=\"nofollow noopener external noreferrer\" target=\"_blank\" href=\"https:\/\/the-decoder.com\/zerosearch-alibaba-trains-search-assistant-in-ai-simulation\/\" data-wpel-link=\"external\">multi-step search queries<\/a> by breaking down complex questions into sub-questions.<\/p>\n<p><!-- designcopy-schema-start --><br \/>\n<script type=\"application\/ld+json\">\n{\n  \"@context\": \"https:\/\/schema.org\",\n  \"@type\": \"Article\",\n  \"headline\": \"How Alibaba\u2019s ZeroSearch Trains LLMs Without Google\",\n  \"description\": \"In a bold move from  Alibaba\u2019s Tongyi Lab ,  ZeroSearch  is flipping the script on how large language models (LLMs) handle searches. This  reinforcement learnin\",\n  \"author\": {\n    \"@type\": \"Person\",\n    \"name\": \"DesignCopy\"\n  },\n  \"datePublished\": \"2025-05-11T11:43:50\",\n  \"dateModified\": \"2026-03-07T13:53:28\",\n  \"image\": {\n    \"@type\": \"ImageObject\",\n    \"url\": \"https:\/\/designcopy.net\/wp-content\/uploads\/logo.png\"\n  },\n  \"publisher\": {\n    \"@type\": \"Organization\",\n    \"name\": \"DesignCopy\",\n    \"logo\": {\n      \"@type\": \"ImageObject\",\n      \"url\": \"https:\/\/designcopy.net\/wp-content\/uploads\/logo.png\"\n    }\n  },\n  \"mainEntityOfPage\": {\n    \"@type\": \"WebPage\",\n    \"@id\": \"https:\/\/designcopy.net\/en\/how-alibaba-zero-search-trains-llms-without-google\/\"\n  }\n}\n<\/script><br \/>\n<script type=\"application\/ld+json\">\n{\n  \"@context\": \"https:\/\/schema.org\",\n  \"@type\": \"WebPage\",\n  \"name\": \"How Alibaba\u2019s ZeroSearch Trains LLMs Without Google\",\n  \"url\": \"https:\/\/designcopy.net\/en\/how-alibaba-zero-search-trains-llms-without-google\/\",\n  \"speakable\": {\n    \"@type\": \"SpeakableSpecification\",\n    \"cssSelector\": [\n      \"h1\",\n      \"h2\",\n      \"p\"\n    ]\n  }\n}\n<\/script><br \/>\n<!-- designcopy-schema-end --><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Alibaba&#8217;s AI trains itself without Google, slashing costs by 88% while matching or beating Google&#8217;s performance. Their 14B model proves it.<\/p>\n","protected":false},"author":1,"featured_media":261438,"comment_status":"closed","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"_et_pb_use_builder":"","_et_pb_old_content":"","_et_gb_content_width":"","footnotes":""},"categories":[242],"tags":[3242,732],"class_list":["post-261439","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-ai-research-innovations","tag-large-language-models","tag-llms","et-has-post-format-content","et_post_format-et-post-format-standard"],"_links":{"self":[{"href":"https:\/\/designcopy.net\/en\/wp-json\/wp\/v2\/posts\/261439","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/designcopy.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/designcopy.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/designcopy.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/designcopy.net\/en\/wp-json\/wp\/v2\/comments?post=261439"}],"version-history":[{"count":3,"href":"https:\/\/designcopy.net\/en\/wp-json\/wp\/v2\/posts\/261439\/revisions"}],"predecessor-version":[{"id":264596,"href":"https:\/\/designcopy.net\/en\/wp-json\/wp\/v2\/posts\/261439\/revisions\/264596"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/designcopy.net\/en\/wp-json\/wp\/v2\/media\/261438"}],"wp:attachment":[{"href":"https:\/\/designcopy.net\/en\/wp-json\/wp\/v2\/media?parent=261439"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/designcopy.net\/en\/wp-json\/wp\/v2\/categories?post=261439"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/designcopy.net\/en\/wp-json\/wp\/v2\/tags?post=261439"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}