<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:yandex="http://news.yandex.ru" xmlns:media="http://search.yahoo.com/mrss/" xmlns:turbo="http://turbo.yandex.ru">
<channel>
<title>rethinking models - Servistopauto リップル</title>
<link>https://servistopauto.ru/</link>
<description>rethinking models - Servistopauto リップル</description><item turbo="true">
<turbo:extendedHtml>true</turbo:extendedHtml>
<link>https://servistopauto.ru/rethinking-models/185-Rethinking-driving-world-model-as-synthetic-data-generator-for-perception-tasks-100-sajin.html</link>
<author>trinitydoramy</author>
<category>rethinking models</category>
<pubDate>Thu, 19 Feb 2026 20:32:20 +0300</pubDate>
<turbo:content><![CDATA[<header><h1>Rethinking driving world model as synthetic data generator for perception tasks (100) 사진</h1></header><p><img src="https://ar5iv.labs.arxiv.org/html/2107.04259/assets/figures/fine_tuning.png" alt="2107.04259] Unity Perception: Generate Synthetic Data for Computer Vision"></p> <p><img src="https://www.mdpi.com/applsci/applsci-15-08162/article_deploy/html/images/applsci-15-08162-g001.png" alt="A GAN-Based Framework with Dynamic Adaptive Attention for Multi-Class Image Segmentation in Autonomous Driving"></p> <p><img src="https://www.marktechpost.com/wp-content/uploads/2025/02/Screenshot-2025-02-09-at-12.05.22%E2%80%AFPM.png" alt="BARE: A Synthetic Data Generation AI Method that Combines the Diversity of Base Models with the Quality of Instruct-Tuned Models - MarkTechPost"></p> <p><img src="https://miro.medium.com/v2/resize:fit:1400/1*vDTB0ngXxtHFQbsLd4SUTw.png" alt="Generative AI Meets Computer Vision | API4AI | Medium"></p> <p><img src="https://dfzljdn9uc3pi.cloudfront.net/2026/cs-3487/1/fig-1-full.png" alt="Flexible multitask deep learning for lane-aware panoptic segmentation in autonomous driving [PeerJ]"></p> <p><img src="https://imgv2-1-f.scribdassets.com/img/document/861673355/original/f8b79fd90d/1?v=1" alt="Review 2 Report........ | PDF | Computer Vision | Artificial Intelligence"></p> <p><img src="https://substackcdn.com/image/fetch/$s_!kZRB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F64313160-0ab9-40c8-91b8-435a787fd3b4_1600x959.png" alt="Important LLM Papers for the Week From 10/11 To 16/11"></p> <p><img src="x-raw-image:///fdd90db64a67121ed5d6fc549934f8741f0cd4431fb431d0a6ef1612d6f94fe6" alt="Generate synthetic datasets and scenarios by learning from the real world"></p> <p><img src="https://cdn.sanity.io/images/h6toihm1/production/681a0d94d1a80f4e0f1750516851bffc7c3c4844-2108x1134.png?auto=format&dpr=2&fit=max&q=75&w=1600" alt="ICCV 2025 Boundary-Pushing CV Research | Voxel51"></p> <p><img src="https://miro.medium.com/v2/resize:fit:1400/1*KG6u2oICnZZ-HuJsShCPAg.png" alt="Important LLM Papers for the Week From 27/10 To 01/11 | by Youssef Hosni | Level Up Coding"></p> <p><img src="https://file.techscience.com/uploads/imgs/202511/7a737741ec34789f2113b823c9aa6902.jpg?t=20220620" alt="CMES | Free Full-Text | Deep Learning and Federated Learning in Human Activity Recognition with Sensor Data: A Comprehensive Review"></p> <p><img src="https://simulately.wiki/img/daily/2025-07-22_12-09.png" alt="Simulately"></p> <p><img src="https://media.springernature.com/full/springer-static/image/art%3A10.1038%2Fs41598-025-32016-8/MediaObjects/41598_2025_32016_Fig1_HTML.png" alt="An interactive segmentation-based method for seismic facies annotation and segmentation | Scientific Reports"></p> <p><img src="https://substackcdn.com/image/fetch/$s_!zFZG!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0bd7b38e-757f-45b3-a593-326d6b97ec02_1760x1199.png" alt="Important LLM Papers for the Week From 28/07 to 02/08"></p> <p><img src="https://simulately.wiki/img/daily/2025-06-11_11-31.png" alt="Simulately"></p> <p><img src="https://media.springernature.com/full/springer-static/image/art%3A10.1038%2Fs41598-024-71187-8/MediaObjects/41598_2024_71187_Fig1_HTML.png" alt="Federated learning system on autonomous vehicles for lane segmentation | Scientific Reports"></p> <p><img src="x-raw-image:///e523b16b86c8c75a3b547e31b3f149db8c87c97381869bba594f3b45aa1f5a5b" alt="Transfer Your Perspective: Controllable 3D Generation from Any Viewpoint in a Driving Scene"></p> <p><img src="https://owen-liuyuxuan.github.io/papers_reading_sharing.github.io/Planning_Control_DL/res/uniad_pipeline.png" alt="End-to-end Autonomous Driving: Challenges and Frontiers - Reading Collections"></p> <p><img src="https://www.cell.com/cms/10.1016/j.xinn.2025.100948/asset/627c42f0-c235-4c27-80c9-e0047053c869/main.assets/fx1_lrg.jpg" alt="Foundation models and intelligent decision-making: Progress, challenges, and perspectives: The Innovation"></p> <p><img src="https://www.marktechpost.com/wp-content/uploads/2025/01/Screenshot-2025-01-23-at-6.37.15%E2%80%AFPM.png" alt="Google AI Introduces Learn-by-Interact: A Data-Centric Framework for Adaptive and Efficient LLM Agent Development - MarkTechPost"></p> <p><img src="https://lookaside.instagram.com/seo/google_widget/crawler/?media_id=3800952540222280364" alt="Ever wondered how the most complex AI systems - like deep neural networks - start with something as simple as a straight line? The Perceptron, the "atom of deep learning," is that"></p> <p><img src="https://www.marktechpost.com/wp-content/uploads/2025/07/Screenshot-2025-07-11-at-1.50.47%E2%80%AFPM.png" alt="From Perception to Action: The Role of World Models in Embodied AI Systems - MarkTechPost"></p> <p><img src="https://media.springernature.com/lw1200/springer-static/image/art%3A10.1007%2Fs00170-025-15834-y/MediaObjects/170_2025_15834_Fig12_HTML.png" alt="Toward sustainable production: a synthetic dataset framework to accelerate quality control via generative and predictive AI | The International Journal of Advanced Manufacturing Technology"></p> <p><img src="https://substackcdn.com/image/fetch/$s_!xbQ5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bbd7180-10dc-4080-bbb3-3b47d46a53b3_1600x1009.png" alt="Important LLM Papers for the Week From 10/11 To 16/11"></p> <p><img src="https://www.preprints.org/frontend/picture/ms_xml/manuscript/e18b33c3a7c7bc6657032ef93d088582/preprints-192374-g006.png" alt="3D Environment Generation from Sparse Inputs for Automated Driving Function Development[v1] | Preprints.org"></p> <p><img src="https://miro.medium.com/1*7_Me18nku0DSvheKjBvCqw.png" alt="The Art of Synthetic Wisdom: Teaching Machines to Think in the Absence of Data | by R. Thompson (PhD) | Towards AI"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/cosmos-drive-dreams-scalable-synthetic-driving-data-generation-with-world-foundation-models-1.png" alt="Literature Review] Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models"></p> <p><img src="https://media.springernature.com/full/springer-static/image/art%3A10.1038%2Fs41467-023-37677-5/MediaObjects/41467_2023_37677_Fig1_HTML.png" alt="Learning naturalistic driving environment with statistical realism | Nature Communications"></p> <p><img src="https://i1.rgstatic.net/publication/397006828_SynAD_Enhancing_Real-World_End-to-End_Autonomous_Driving_Models_through_Synthetic_Data_Integration/links/69018e454baee165918e7993/largepreview.png" alt="PDF) SynAD: Enhancing Real-World End-to-End Autonomous Driving Models through Synthetic Data Integration"></p> <p><img src="https://www.frontiersin.org/files/Articles/1451923/fnbot-19-1451923-HTML/image_m/fnbot-19-1451923-g004.jpg" alt="Frontiers | A survey of decision-making and planning methods for self- driving vehicles"></p> <p><img src="https://pica.zhimg.com/v2-bea81885bab59f535047bb2bdada7998_1440w.jpg" alt="分享下我们团队最新工作Dream4Drive【小米汽车x北大】 - 知乎"></p> <p><img src="x-raw-image:///dfbf2882333ede688e1e9bbd8a013a8fa454c403192a4c89a40b1e79455b8f0c" alt="A Comprehensive Literature Review on Modular Approaches to Autonomous Driving: Deep Learning for Road and Racing Scenarios"></p> <p><img src="https://simulately.wiki/img/daily/2025-07-22_12-03.png" alt="Simulately"></p> <p><img src="https://lookaside.instagram.com/seo/google_widget/crawler/?media_id=3797284324183175858" alt="Ever wondered how the most complex AI systems - like deep neural networks - start with something as simple as a straight line? The Perceptron, the "atom of deep learning," is that"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/cosmos-drive-dreams-scalable-synthetic-driving-data-generation-with-world-foundation-models-4.png" alt="Literature Review] Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models"></p> <p><img src="https://i1.rgstatic.net/publication/372404110_Drive_Like_a_Human_Rethinking_Autonomous_Driving_with_Large_Language_Models/links/65ee5c9db7819b433bf52adc/largepreview.png" alt="PDF) Drive Like a Human: Rethinking Autonomous Driving with Large Language Models"></p> <p><img src="https://media.springernature.com/lw1200/springer-static/image/art%3A10.1007%2Fs00170-025-15834-y/MediaObjects/170_2025_15834_Fig4_HTML.jpg" alt="Toward sustainable production: a synthetic dataset framework to accelerate quality control via generative and predictive AI | The International Journal of Advanced Manufacturing Technology"></p> <p><img src="https://www.degruyterbrill.com/document/doi/10.1515/auto-2023-0026/asset/graphic/j_auto-2023-0026_fig_004.jpg" alt="Synthetic data generation for the continuous development ..."></p> <p><img src="https://substackcdn.com/image/fetch/$s_!ZMZW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F770ed748-92c8-4f4d-a046-5d032cbf489d_1600x806.png" alt="Important LLM Papers for the Week From 10/11 To 16/11"></p> <p><img src="https://media.springernature.com/full/springer-static/image/art%3A10.1057%2Fs41599-024-03611-3/MediaObjects/41599_2024_3611_Fig1_HTML.png" alt="Large language models empowered agent-based modeling and simulation: a survey and perspectives | Humanities and Social Sciences Communications"></p> <p><img src="https://cdn-uploads.huggingface.co/production/uploads/64e9c855233101ed99ca2315/TF-bMlMRnJDe-5i4Dlm8W.jpeg" alt="Paper page - GenieDrive: Towards Physics-Aware Driving World Model with 4D Occupancy Guided Video Generation"></p> <p><img src="https://simulately.wiki/img/daily/2025-06-18_11-00.png" alt="Simulately"></p> <p><img src="https://lookaside.instagram.com/seo/google_widget/crawler/?media_id=3793952859566810720" alt="A recent T-RO paper introduces a model-simplification framework for efficient deformable-object manipulation: task-conditioned action-space reduction plus simplified #dynamics lets faster planning for cloth folding & rope shaping. Paper title: Goal ..."></p> <p><img src="https://cdn-uploads.huggingface.co/production/uploads/64e57309b78bc92221ce3b70/dwqGmx-hiCdAikyUuNLkO.png" alt="Daily Papers - Hugging Face"></p> <p><img src="https://media.springernature.com/lw1200/springer-static/image/art%3A10.1007%2Fs11831-025-10355-z/MediaObjects/11831_2025_10355_Fig9_HTML.png" alt="A Critical Analysis of Generative AI: Challenges, Opportunities, and Future Research Directions | Archives of Computational Methods in Engineering | Springer Nature Link"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/cosmos-drive-dreams-scalable-synthetic-driving-data-generation-with-world-foundation-models-0.png" alt="Literature Review] Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models"></p> <p><img src="https://arxiv.org/html/2601.10061v1/x2.png" alt="Papers by Bohan Zeng"></p> <p><img src="https://media.springernature.com/lw1200/springer-static/image/art%3A10.1007%2Fs42154-024-00347-3/MediaObjects/42154_2024_347_Fig3_HTML.png" alt="Certified Robustness in Automated Driving Perception: A Review | Automotive Innovation"></p> <p><img src="https://data.the-innovation.org//innovation-data/informatics/2025/1/PIC/TII-2025-0014-Abstract.jpg" alt="AeroVerse-Review: Comprehensive survey on aerial embodied vision-and-language navigation"></p> <p><img src="https://www.preprints.org/frontend/picture/ms_xml/manuscript/f40cf2027ebc78f06dace3dadd888556/preprints-148797-g006.png" alt="Synchronization, Optimization, and Adaptation of Machine Learning Techniques for Computer Vision in Cyber-Physical Systems: A Comprehensive Analysis[v2] | Preprints.org"></p> <p><img src="https://media.springernature.com/lw1200/springer-static/image/art%3A10.1007%2Fs00170-025-15834-y/MediaObjects/170_2025_15834_Fig1_HTML.jpg" alt="Toward sustainable production: a synthetic dataset framework to accelerate quality control via generative and predictive AI | The International Journal of Advanced Manufacturing Technology"></p> <p><img src="https://imgv2-1-f.scribdassets.com/img/document/758587985/original/08fe4ef5cd/1?v=1" alt="Multimodal Language Models in Autonomous Driving | PDF | Artificial Intelligence | Intelligence (AI) & Semantics"></p> <p><img src="https://cdn.sanity.io/images/h6toihm1/production/271ea9ce82ffb14f837b39bfeadb3a4997112fba-2516x1260.png?auto=format&dpr=2&fit=max&q=75&w=1600" alt="ICCV 2025 Boundary-Pushing CV Research | Voxel51"></p> <p><img src="https://www.preprints.org/frontend/picture/ms_xml/manuscript/e18b33c3a7c7bc6657032ef93d088582/preprints-192374-g007.png" alt="3D Environment Generation from Sparse Inputs for Automated Driving Function Development[v1] | Preprints.org"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/cosmos-drive-dreams-scalable-synthetic-driving-data-generation-with-world-foundation-models-3.png" alt="Literature Review] Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models"></p> <p><img src="https://www.marktechpost.com/wp-content/uploads/2024/03/Screenshot-2024-03-04-at-2.29.17-PM.png" alt="Automated Prompt Engineering: Leveraging Synthetic Data and Meta-Prompts for Enhanced LLM Performance - MarkTechPost"></p> <p><img src="https://media.springernature.com/lw1200/springer-static/image/art%3A10.1007%2Fs42154-024-00347-3/MediaObjects/42154_2024_347_Fig6_HTML.png" alt="Certified Robustness in Automated Driving Perception: A Review | Automotive Innovation"></p> <p><img src="https://www.cell.com/cms/10.1016/j.xinn.2025.100948/asset/1d7ebdda-302a-4ef0-a2d7-2d90268450ec/main.assets/gr5_lrg.jpg" alt="Foundation models and intelligent decision-making: Progress, challenges, and perspectives: The Innovation"></p> <p><img src="x-raw-image:///447448f483f6907c95a1aba761845f60621eeaea498378d22f9e17b854a1d639" alt="Vision Language Models in Autonomous Driving: A Survey and Outlook"></p> <p><img src="https://www.preprints.org/frontend/picture/ms_xml/manuscript/e18b33c3a7c7bc6657032ef93d088582/preprints-192374-g002.png" alt="3D Environment Generation from Sparse Inputs for Automated Driving Function Development[v1] | Preprints.org"></p> <p><img src="https://miro.medium.com/v2/resize:fit:1400/1*a0YQ2cdVG1cKZ0x87reROQ.png" alt="Important LLM Papers for the Week From 27/10 To 01/11 | by Youssef Hosni | Level Up Coding"></p> <p><img src="https://www.preprints.org/frontend/picture/ms_xml/manuscript/f40cf2027ebc78f06dace3dadd888556/preprints-148797-g009.png" alt="Synchronization, Optimization, and Adaptation of Machine Learning Techniques for Computer Vision in Cyber-Physical Systems: A Comprehensive Analysis[v2] | Preprints.org"></p> <p><img src="x-raw-image:///30f4488f537038ec000f8289a45c0ff68160942b6b6d69c4c8cd621217f69aa2" alt="Synthetic Data Generation Using Large Language Models: Advances in Text and Code"></p> <p><img src="https://www.marktechpost.com/wp-content/uploads/2026/01/NVIDIA-MOBILE-3-scaled.png" alt="BARE: A Synthetic Data Generation AI Method that Combines the Diversity of Base Models with the Quality of Instruct-Tuned Models - MarkTechPost"></p> <p><img src="https://media.springernature.com/lw1200/springer-static/image/art%3A10.1057%2Fs41599-024-03611-3/MediaObjects/41599_2024_3611_Fig4_HTML.png" alt="Large language models empowered agent-based modeling and simulation: a survey and perspectives | Humanities and Social Sciences Communications"></p> <p><img src="https://simulately.wiki/img/daily/2025-06-26_17-33_1.png" alt="Simulately"></p> <p><img src="https://lookaside.instagram.com/seo/google_widget/crawler/?media_id=3797284324065681153" alt="Ever wondered how the most complex AI systems - like deep neural networks - start with something as simple as a straight line? The Perceptron, the "atom of deep learning," is that"></p> <p><img src="https://www.marktechpost.com/wp-content/uploads/2023/11/Screenshot-2023-11-29-at-6.59.03-AM.png" alt="This AI Research Introduces FollowNet: A Comprehensive Benchmark Dataset for Car-Following Behavior Modeling - MarkTechPost"></p> <p><img src="https://i1.rgstatic.net/publication/398601988_WorldLens_Full-Spectrum_Evaluations_of_Driving_World_Models_in_Real_World/links/693b96207e61d05b530e05d5/largepreview.png" alt="PDF) WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World"></p> <p><img src="https://ar5iv.labs.arxiv.org/html/2312.00438/assets/x17.png" alt="2312.00438] Dolphins: Multimodal Language Model for Driving"></p> <p><img src="https://www.mdpi.com/aerospace/aerospace-10-00604/article_deploy/html/images/aerospace-10-00604-g001.png" alt="Synthetic Training Data for Semantic Segmentation of the Environment from UAV Perspective"></p> <p><img src="https://substackcdn.com/image/fetch/$s_!qEHI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb1cb034c-1227-4751-bb1c-6e42b142f6bd_1600x771.png" alt="Important LLM Papers for the Week From 27/10 To 01/11"></p> <p><img src="https://lookaside.fbsbx.com/lookaside/crawler/media/?media_id=3860399364231788" alt="Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More? (Google DeepMind, July 2024) Paper: [https://arxiv.org/abs/2406.13121](https://arxiv.org/abs/2406.13121) Abstract: "Long-context language models (LCLMs) have the potential to ..."></p> <p><img src="https://www.mdpi.com/smartcities/smartcities-08-00079/article_deploy/html/images/smartcities-08-00079-g001.png" alt="A Comprehensive Literature Review on Modular Approaches to Autonomous Driving: Deep Learning for Road and Racing Scenarios"></p> <p><img src="x-raw-image:///bd7b24c6f225b98a7330b7deb0e000b40a09f8f98c969b77c4982f0df530048d" alt="Generative AI for Autonomous Driving: Frontiers and Opportunities"></p> <p><img src="x-raw-image:///ce5639040c00dcf0f1bce18f047ad2c500bc26c0da5bffd6c033a2ef27f023ca" alt="Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving"></p> <p><img src="https://grafft.github.io/assets/img/publication_preview/segdt2025.png" alt="publications | Aleksandr I. Panov"></p> <p><img src="https://lookaside.fbsbx.com/lookaside/crawler/media/?media_id=4071512173120505" alt="Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling (Shanghai AI Laboratory, February 2025) Paper: [https://arxiv.org/abs/2502.06703](https://arxiv.org/abs/2502.06703) Abstract: "Test-Time Scaling (TTS) is an important method for ..."></p> <p><img src="https://lookaside.instagram.com/seo/google_widget/crawler/?media_id=3804828883984471860" alt="🧠 Top 5 Trends in Data Annotation Heading Into 2026 In 2026, the focus is shifting from “more labels” to smarter, scalable, and production-ready data pipelines. Here are the five key data"></p> <p><img src="https://ietresearch.onlinelibrary.wiley.com/cms/asset/7b350e90-bbe8-45ea-824d-6e23345cfea5/ipr212710-fig-0010-m.jpg" alt="Multi‐domain autonomous driving dataset: Towards enhancing the generalization of the convolutional neural networks in new environments - Khosravian - 2023 - IET Image Processing - Wiley Online Library"></p> <p><img src="https://lookaside.instagram.com/seo/google_widget/crawler/?media_id=3802514425094976747" alt="🧠 Top 5 Trends in Data Annotation Heading Into 2026 In 2026, the focus is shifting from “more labels” to smarter, scalable, and production-ready data pipelines. Here are the five key data"></p> <p><img src="https://www.cell.com/cms/10.1016/j.patter.2025.101260/asset/38a90fb3-8309-4eff-82eb-366b1379ece8/main.assets/gr10_lrg.jpg" alt="Unleashing the potential of prompt engineering for large language models: Patterns"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/cosmos-drive-dreams-scalable-synthetic-driving-data-generation-with-world-foundation-models-2.png" alt="Literature Review] Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models"></p> <p><img src="https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20241111153229223-0840:S0263574724001115:S0263574724001115_fig5.png?pub-status=live" alt="Robust motion planning for mobile robots under attacks against obstacle localization | Robotica | Cambridge Core"></p> <p><img src="x-raw-image:///01c309847cdbb51321c4feb627515f84b9bbde64e0749a855aaad5f1265f90b7" alt="Generate synthetic datasets and scenarios by learning from the real world"></p> <p><img src="https://www.mdpi.com/smartcities/smartcities-08-00079/article_deploy/html/images/smartcities-08-00079-g007.png" alt="A Comprehensive Literature Review on Modular Approaches to Autonomous Driving: Deep Learning for Road and Racing Scenarios"></p> <p><img src="https://www.researchgate.net/publication/344372443/figure/fig1/AS:962203742117894@1606418657887/Vision-tasks-in-clear-and-rain-augmented-images-Our-synthetic-rain-rendering-framework.png" alt="Vision tasks in clear and rain-augmented images. Our synthetic rain... | Download Scientific Diagram"></p> <p><img src="https://www.marktechpost.com/wp-content/uploads/2025/01/Screenshot-2025-01-03-at-10.20.09%E2%80%AFPM.png" alt="OS-Genesis: A Novel GUI Data Synthesis Pipeline that Reverses the Conventional Trajectory Collection Process - MarkTechPost"></p> <p><img src="https://dfzljdn9uc3pi.cloudfront.net/2024/cs-2250/1/fig-6-full.png" alt="Road surface semantic segmentation for autonomous driving [PeerJ]"></p> <p><img src="https://raw.githubusercontent.com/zhengchen1999/RGT/main/figs/RGT.png" alt="4x RGT_S - OpenModelDB"></p> <p><img src="https://www.mdpi.com/smartcities/smartcities-08-00079/article_deploy/html/images/smartcities-08-00079-g004.png" alt="A Comprehensive Literature Review on Modular Approaches to Autonomous Driving: Deep Learning for Road and Racing Scenarios"></p> <p><img src="https://i1.rgstatic.net/publication/395402499_Foundation_Models_for_Autonomous_Driving_Perception_A_Survey_Through_Core_Capabilities/links/68c241ca73c8345b7a5b91af/largepreview.png" alt="PDF) Foundation Models for Autonomous Driving Perception: A Survey Through Core Capabilities"></p> <p><img src="https://ietresearch.onlinelibrary.wiley.com/cms/asset/92912fd6-1a43-41ee-b5dc-a456fc8a678d/ipr212710-fig-0009-m.jpg" alt="Multi‐domain autonomous driving dataset: Towards enhancing the generalization of the convolutional neural networks in new environments - Khosravian - 2023 - IET Image Processing - Wiley Online Library"></p> <p><img src="https://dfzljdn9uc3pi.cloudfront.net/2024/cs-2250/1/fig-2-full.png" alt="Road surface semantic segmentation for autonomous driving [PeerJ]"></p> <p><img src="https://media.springernature.com/lw1200/springer-static/image/art%3A10.1186%2Fs40537-025-01247-x/MediaObjects/40537_2025_1247_Fig1_HTML.png" alt="Generative AI in depth: A survey of recent advances, model variants, and real-world applications | Journal of Big Data | Springer Nature Link"></p> <p><img src="https://pub.mdpi-res.com/sensors/sensors-25-03586/article_deploy/html/images/sensors-25-03586-ag.png?1749539961" alt="Closing the Domain Gap: Can Pseudo-Labels from Synthetic UAV Data Enable Real-World Flood Segmentation?"></p> <p><img src="https://picx.zhimg.com/v2-9743fccc9854e9c09e079b51de6a027d_1440w.jpg" alt="CV计算机视觉每日开源代码Paper with code速览-2025.11.15 - 知乎"></p> <p><img src="https://substackcdn.com/image/fetch/$s_!mc-o!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faaaeb570-af00-4904-8b96-2b78c269c013_1600x766.png" alt="Important LLM Papers for the Week From 27/10 To 01/11"></p> <p><img src="http://www.jacc.org/cms/asset/ee98c18b-54a1-4d59-8637-ee2ebef8d6e7/ga1.jpg" alt="Artificial Intelligence for Cardiovascular Care in Action: From Learning to Implementation in Health Systems | JACC: Advances"></p>]]></turbo:content>
</item><item turbo="true">
<turbo:extendedHtml>true</turbo:extendedHtml>
<link>https://servistopauto.ru/rethinking-models/186-Rethinking-entropy-regularization-in-large-reasoning-models-89-sajin.html</link>
<author>trinitydoramy</author>
<category>rethinking models</category>
<pubDate>Thu, 19 Feb 2026 20:32:20 +0300</pubDate>
<turbo:content><![CDATA[<header><h1>Rethinking entropy regularization in large reasoning models (89) 사진</h1></header><p><img src="https://wires.onlinelibrary.wiley.com/cms/asset/6e7e14bd-69cc-4324-9927-dc4961981a31/wics70049-toc-0001-m.jpg" alt="Theoretical Perspectives on Knowledge Distillation: A Review - Liu - 2025 - WIREs Computational Statistics - Wiley Online Library"></p> <p><img src="https://www.marktechpost.com/wp-content/uploads/2025/04/Screenshot-2025-04-17-at-8.55.39%E2%80%AFPM.png" alt="Do Reasoning Models Really Need Transformers?: Researchers from TogetherAI, Cornell, Geneva, and Princeton Introduce M1—A Hybrid Mamba-Based AI that Matches SOTA Performance at 3x Inference Speed - MarkTechPost"></p> <p><img src="https://www.marktechpost.com/wp-content/uploads/2025/03/Screenshot-2025-03-22-at-5.29.25%E2%80%AFPM.png" alt="Fin-R1: A Specialized Large Language Model for Financial Reasoning and Decision-Making - MarkTechPost"></p> <p><img src="https://www.cell.com/cms/10.1016/j.patter.2025.101370/asset/79ba6fd6-d2fc-4a84-89da-638486dc8b0d/main.assets/gr1_lrg.jpg" alt="Toward large reasoning models: A survey of reinforced reasoning with large language models: Patterns"></p> <p><img src="x-raw-image:///81c32f087933c5b4b5ab2c14735e9727b46e7e8fcb9719a1f77cb8af1209c2ea" alt="Tractable Regularization of Probabilistic Circuits"></p> <p><img src="https://www.preprints.org/frontend/picture/ms_xml/manuscript/d540f3adf94740ecad89ff070d49d546/preprints-167478-g006.png" alt="Conditional Domain Adaptation with α-Rényi Entropy Regularization and Noise-Aware Label Weighting[v1] | Preprints.org"></p> <p><img src="https://www.preprints.org/frontend/picture/ms_xml/manuscript/d540f3adf94740ecad89ff070d49d546/preprints-167478-g001.png" alt="Conditional Domain Adaptation with α-Rényi Entropy Regularization and Noise-Aware Label Weighting[v1] | Preprints.org"></p> <p><img src="https://lookaside.fbsbx.com/lookaside/crawler/media/?media_id=3880783375526720" alt="Internal Consistency and Self-Feedback in Large Language Models: A Survey (Renmin University of China, July 2024) Paper: [https://arxiv.org/abs/2407.14507](https://arxiv.org/abs/2407.14507) Abstract: "Large language models (LLMs) are expected to ..."></p> <p><img src="https://substackcdn.com/image/fetch/$s_!VE6s!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4192ad2-161d-4040-a78f-c951ee003dad_1600x900.png" alt="Important LLM Papers for the Week From 27/10 To 01/11"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/rethinking-llm-human-simulation-when-a-graph-is-what-you-need-1.png" alt="Literature Review] Rethinking LLM Human Simulation: When a Graph is What You Need"></p> <p><img src="https://miro.medium.com/1*zGIEOOKo2JstKNdvowPaGA.png" alt="EPO: Entropy-Regularized Policy Optimization for LLM Agents Reinforcement Learning | by Dixon | Medium"></p> <p><img src="https://www.cell.com/cms/10.1016/j.patter.2025.101370/asset/526d7c72-c3d6-4e0c-9a2f-864d34ac3919/main.assets/gr4_lrg.jpg" alt="Toward large reasoning models: A survey of reinforced reasoning with large language models: Patterns"></p> <p><img src="https://www.cell.com/cms/10.1016/j.patter.2025.101370/asset/eeea5ce1-ba2b-4ab1-acdf-c5c98dbc9110/main.assets/gr7_lrg.jpg" alt="Toward large reasoning models: A survey of reinforced reasoning with large language models: Patterns"></p> <p><img src="https://www.marktechpost.com/wp-content/uploads/2025/05/Screenshot-2025-05-22-at-11.36.50%E2%80%AFAM.png" alt="Beyond Aha Moments: Structuring Reasoning in Large Language Models - MarkTechPost"></p> <p><img src="https://i1.rgstatic.net/publication/392629816_Spurious_Rewards_Rethinking_Training_Signals_in_RLVR/links/684b99b643aad60b4c16b771/largepreview.png" alt="PDF) Spurious Rewards: Rethinking Training Signals in RLVR"></p> <p><img src="https://simulately.wiki/img/daily/2025-07-22_12-09.png" alt="Simulately"></p> <p><img src="https://pic2.zhimg.com/v2-031997a8a0f4b09cbc5b1c8283f4e3eb_1440w.jpg" alt="上海AI lab 提出SIREN：重新思考RLVR 中的熵正则- 知乎"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/review-of-inference-time-scaling-strategies-reasoning-search-and-rag-1.png" alt="Literature Review] Review of Inference-Time Scaling Strategies: Reasoning, Search and RAG"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/rethinking-entropy-regularization-in-large-reasoning-models-2.png" alt="Literature Review] Rethinking Entropy Regularization in Large Reasoning Models"></p> <p><img src="https://www.mdpi.com/entropy/entropy-27-00722/article_deploy/html/images/entropy-27-00722-g003.png" alt="Entropy-Regularized Attention for Explainable Histological Classification with Convolutional and Hybrid Models"></p> <p><img src="https://emergentmind-storage-cdn-c7atfsgud9cecchk.z01.azurefd.net/ai-images/67b739be38cc2be8dbe1091107751a63.webp" alt="Reinforcement Learning with Verifiable Rewards"></p> <p><img src="https://www.preprints.org/frontend/picture/ms_xml/manuscript/eb6fbb4728ee2b580db943cc1a1e4f1b/preprints-185070-g002.png" alt="Contextual Knowledge Infusion via Iterative Semantic Tracing for Vision–Language Understanding[v1] | Preprints.org"></p> <p><img src="https://substackcdn.com/image/fetch/$s_!PcKh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8d635353-5ff8-4edb-a2cd-67c3f8527b95_1472x1036.png" alt="The State of Reinforcement Learning for LLM Reasoning"></p> <p><img src="https://rlhfbook.com/c/images/rlhf-overview.png" alt="Reward Modeling | RLHF Book by Nathan Lambert"></p> <p><img src="https://www.cell.com/cms/10.1016/j.patter.2025.101370/asset/81b529bb-7757-494f-bd65-811661efc971/main.assets/gr3_lrg.jpg" alt="Toward large reasoning models: A survey of reinforced reasoning with large language models: Patterns"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/clip-low-increases-entropy-and-clip-high-decreases-entropy-in-reinforcement-learning-of-large-language-models-4.png" alt="Literature Review] Clip-Low Increases Entropy and Clip-High Decreases Entropy in Reinforcement Learning of Large Language Models"></p> <p><img src="https://media.springernature.com/full/springer-static/image/art%3A10.1038%2Fs41598-025-29849-8/MediaObjects/41598_2025_29849_Fig1_HTML.png" alt="Multi-access edge computing scheduling optimization model for remote education under 6G network environment based on reinforcement learning | Scientific Reports"></p> <p><img src="https://www.marktechpost.com/wp-content/uploads/2025/06/Screenshot-2025-06-18-at-11.51.15%E2%80%AFPM.png" alt="ReVisual-R1: An Open-Source 7B Multimodal Large Language Model (MLLMs) that Achieves Long, Accurate and Thoughtful Reasoning - MarkTechPost"></p> <p><img src="x-raw-image:///c54023b948bca4ec408c8884a3c6bbbbd2b37b1f34f9f0cb958967de7787ca78" alt="Depth-Breadth Synergy in RLVR: Unlocking LLM Reasoning Gains with Adaptive Exploration"></p> <p><img src="https://miro.medium.com/v2/resize:fit:1400/1*s8nNRJ-mRHp3RrcWHo_zRQ.png" alt="Important LLM Papers for the Week From 28/07 to 02/08 | by Youssef Hosni | Towards AI"></p> <p><img src="https://substackcdn.com/image/fetch/$s_!abwB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4f17a2fc-0cbf-446d-bd27-fc8c6ac4117a_1600x1238.png" alt="The State of Reinforcement Learning for LLM Reasoning"></p> <p><img src="https://guoqiangwei.xyz/iclr2023_stats/images/stats_bar_1117.png" alt="ICLR2023 Statistics"></p> <p><img src="x-raw-image:///8925c53478a2def9aa2ed6c9636b0d8217fcd54eabd621b562cb4d4d7ec6f0d3" alt="Rethinking Data Use in Large Language Models"></p> <p><img src="x-raw-image:///697273404b81a3cc8489deb9a4b9efd250598ff8ba2e5d1c2bf1829093243424" alt="Rethinking Chain-of-Thought from the Perspective of Self-Training"></p> <p><img src="https://substackcdn.com/image/fetch/$s_!FL9p!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F96e79a6d-e9c2-49a9-8242-de47ae444408_1600x830.png" alt="The State of Reinforcement Learning for LLM Reasoning"></p> <p><img src="https://substackcdn.com/image/fetch/$s_!cy0i!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F348906f7-f3ce-4b68-9513-b364de1efdbd_1760x887.png" alt="Important LLM Papers for the Week From 28/07 to 02/08"></p> <p><img src="https://media.springernature.com/lw1200/springer-static/image/art%3A10.1007%2Fs11263-024-02321-2/MediaObjects/11263_2024_2321_Fig5_HTML.png" alt="Rethinking Generalizability and Discriminability of Self-Supervised Learning from Evolutionary Game Theory Perspective | International Journal of Computer Vision"></p> <p><img src="https://www.preprints.org/frontend/picture/ms_xml/manuscript/d540f3adf94740ecad89ff070d49d546/preprints-167478-g007.png" alt="Conditional Domain Adaptation with α-Rényi Entropy Regularization and Noise-Aware Label Weighting[v1] | Preprints.org"></p> <p><img src="https://miro.medium.com/v2/resize:fit:1400/1*KG6u2oICnZZ-HuJsShCPAg.png" alt="Important LLM Papers for the Week From 27/10 To 01/11 | by Youssef Hosni | Level Up Coding"></p> <p><img src="https://pbs.twimg.com/media/G42EjnpbQAQCkZd.jpg" alt="Yafu Li (@yafuly) / Posts / X"></p> <p><img src="x-raw-image:///88b237e21c63427ce195d3732d1e932f4452fac883a5bfaa2a6ceaa248ae9a22" alt="Can Large Reasoning Models Self-Train?"></p> <p><img src="https://towardsdatascience.com/wp-content/uploads/2025/04/20250430_1812_AI-Brain-Solving-Equation_simple_compose_01jt3m0qfbf1paveshd04tg6sh.png" alt="Reinforcement Learning from One Example? | Towards Data Science"></p> <p><img src="https://substackcdn.com/image/fetch/$s_!2SLQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F739d9d44-4d24-4d82-bbe8-a68ae5bedd2c_1600x1116.png" alt="The State of Reinforcement Learning for LLM Reasoning"></p> <p><img src="https://www.marktechpost.com/wp-content/uploads/2025/06/Screenshot-2025-06-01-at-9.47.21%E2%80%AFPM.png" alt="Off-Policy Reinforcement Learning RL with KL Divergence Yields Superior Reasoning in Large Language Models - MarkTechPost"></p> <p><img src="https://www.mlpod.com/wp-content/uploads/2025/10/201cd5ae-5b58-4a02-ad9e-83ffcce0745c.png" alt="上海AI lab 提出SIREN：重新思考RLVR 中的熵正则- 机器学习POD"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/from-entropy-to-epiplexity-rethinking-information-for-computationally-bounded-intelligence-1.png" alt="论文评述] From Entropy to Epiplexity: Rethinking Information for Computationally Bounded Intelligence"></p> <p><img src="https://media.springernature.com/lw1200/springer-static/image/art%3A10.1007%2Fs11263-024-02321-2/MediaObjects/11263_2024_2321_Fig6_HTML.png" alt="Rethinking Generalizability and Discriminability of Self-Supervised Learning from Evolutionary Game Theory Perspective | International Journal of Computer Vision"></p> <p><img src="https://media.springernature.com/full/springer-static/image/art%3A10.1038%2Fs41586-025-09631-6/MediaObjects/41586_2025_9631_Fig1_HTML.png" alt="Aligning machine and human visual representations across abstraction levels | Nature"></p> <p><img src="https://picx.zhimg.com/v2-22569b94e396a26ad4bf9150b690db83_1440w.jpg" alt="上海AI lab 提出SIREN：重新思考RLVR 中的熵正则- 知乎"></p> <p><img src="https://media.springernature.com/full/springer-static/image/art%3A10.1038%2Fs42256-024-00961-0/MediaObjects/42256_2024_961_Fig1_HTML.png" alt="Learning from models beyond fine-tuning | Nature Machine Intelligence"></p> <p><img src="https://pic2.zhimg.com/v2-0b480ed523b38792e2b6a98c707cbd13_1440w.jpg" alt="无监督RL的粗略分析- 知乎"></p> <p><img src="https://www.mdpi.com/files/uploaded/covers/entropy/big_cover-entropy-v27-i3.png" alt="Entropy | March 2025 - Browse Articles"></p> <p><img src="https://miro.medium.com/v2/resize:fit:1400/1*0TP8ZWb44Cy76nwGcP5KXA.png" alt="Important LLM Papers for the Week From 28/07 to 02/08 | by Youssef Hosni | Towards AI"></p> <p><img src="https://pbs.twimg.com/media/G2wLuJRaAAYNcHW.jpg" alt="Shamane Siri (@GShamane) / Posts / X"></p> <p><img src="https://www.pnas.org/cms/10.1073/pnas.2411909122/asset/f879855d-8990-40f4-a9e1-1c0946b86b92/assets/images/large/pnas.2411909122fig01.jpg" alt="Emergent neuronal mechanisms mediating covert attention in convolutional neural networks | PNAS"></p> <p><img src="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9c02ecf0-cb1d-4f62-a160-6d07636b99fd_1600x1384.png" alt="The State of Reinforcement Learning for LLM Reasoning"></p> <p><img src="https://substackcdn.com/image/fetch/$s_!Vxu-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb229cbf-7e4c-4633-ae1f-50266d791abc_1600x1070.png" alt="The State of Reinforcement Learning for LLM Reasoning"></p> <p><img src="x-raw-image:///6247e25d801ce792c886171994eb1f52d66dba874818dc2962e6831fefd9e907" alt="Structure Trumps Size: Rethinking Data Quality for LLM Reasoning"></p> <p><img src="https://lookaside.fbsbx.com/lookaside/crawler/media/?media_id=2724692731200311" alt="Today I came across an interesting phenomenon in LLM reasoning on X. Some models actually switch languages mid-problem—not because they are confused, but because they are optimizing. Why? Because Chinese is computationally"></p> <p><img src="https://rlhfbook.com/c/images/rlvr-system.png" alt="Reasoning Training & Inference-Time Scaling | RLHF Book by Nathan Lambert"></p> <p><img src="https://www.cell.com/cms/10.1016/j.patter.2025.101370/asset/b2cd2e1d-a8de-4c73-aa45-a41dba78fde6/main.assets/gr6_lrg.jpg" alt="Toward large reasoning models: A survey of reinforced reasoning with large language models: Patterns"></p> <p><img src="https://www.marktechpost.com/wp-content/uploads/2025/06/Screenshot-2025-06-18-at-1.14.09%E2%80%AFAM.png" alt="AREAL: Accelerating Large Reasoning Model Training with Fully Asynchronous Reinforcement Learning - MarkTechPost"></p> <p><img src="https://www.mdpi.com/entropy/entropy-27-00722/article_deploy/html/images/entropy-27-00722-g001.png" alt="Entropy-Regularized Attention for Explainable Histological Classification with Convolutional and Hybrid Models"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/the-entropy-mechanism-of-reinforcement-learning-for-reasoning-language-models-2.png" alt="Literature Review] The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models"></p> <p><img src="https://neurips.cc/media/PosterPDFs/NeurIPS%202025/119944.png?t=1765957629.3932862" alt="NeurIPS Poster Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?"></p> <p><img src="https://wires.onlinelibrary.wiley.com/cms/asset/085c6214-2ffd-4efa-a1bb-10749fab90e8/widm70038-toc-0001-m.jpg" alt="A Guide to Machine Learning Epistemic Ignorance, Hidden Paradoxes, and Other Tensions - Naser - 2025 - WIREs Data Mining and Knowledge Discovery - Wiley Online Library"></p> <p><img src="https://www.cell.com/cms/10.1016/j.patter.2025.101370/asset/bd68d24b-3810-438c-b433-0cc76f115834/main.assets/gr8_lrg.jpg" alt="Toward large reasoning models: A survey of reinforced reasoning with large language models: Patterns"></p> <p><img src="https://miro.medium.com/v2/resize:fit:1400/1*5yklKWjrRKHoxSZT4jMiOw.png" alt="Important LLM Papers for the Week From 28/07 to 02/08 | by Youssef Hosni | Towards AI"></p> <p><img src="x-raw-image:///401bacec98b5cf9106cf452b41c78b2d3f75dfa9739eee401c7d3f694c0ad9f6" alt="Rethinking Entropy Interventions in RLVR: An Entropy Change Perspective"></p> <p><img src="https://media.licdn.com/dms/image/v2/D5605AQHiLWxk-Bb2-g/videocover-high/B56Zo4ldn2JQBU-/0/1761885936943?e=2147483647&v=beta&t=xGRrv9gTsfTeuE4MoDzvshZO5ajo1sGrbYyH_FZ7ALc" alt="Scalable MatMul-free Language Modeling: Ternary Weights and MLGRU | Devansh Devansh posted on the topic | LinkedIn"></p> <p><img src="https://miro.medium.com/1*2Czvl13aQL8zZZBAZNS-Eg.png" alt="Beyond the Mean: How Quantile Baselines Tame Entropy in AI Reasoning | by GlitchQ | Medium"></p> <p><img src="x-raw-image:///4f6c01e965e23115f581cf06c789aff966e98d6ebb0fa0b9a3bd3c7d6bd21476" alt="Tractable Regularization of Probabilistic Circuits"></p> <p><img src="https://www.marktechpost.com/wp-content/uploads/2025/03/Screenshot-2025-03-29-at-7.01.48%E2%80%AFPM.png" alt="Advancing Medical Reasoning with Reinforcement Learning from Verifiable Rewards (RLVR): Insights from MED-RLVR - MarkTechPost"></p> <p><img src="https://raw.githubusercontent.com/natolambert/rlhf-book/main/assets/rlhf-book-share.png" alt="Reasoning Training & Inference-Time Scaling | RLHF Book by Nathan Lambert"></p> <p><img src="https://media.beehiiv.com/cdn-cgi/image/fit=scale-down,quality=80,format=auto,onerror=redirect/uploads/asset/file/3773f09d-d8db-48c2-87d6-0f0ee9331794/DeepSeek_training-2025-01-28-151508.png" alt="DeepSeek-R1 Special Edition"></p> <p><img src="x-raw-image:///8fc3be0fc099c45cea678bae7eb1faf2c5f35151895d0628830832805b498ea4" alt="RETHINKING ENTROPY REGULARIZATION IN LARGE REASONING MODELS"></p> <p><img src="https://i1.rgstatic.net/publication/395972622_Rethinking_Entropy_Regularization_in_Large_Reasoning_Models/links/68db6fa2d221a404b2a38959/largepreview.png" alt="PDF) Rethinking Entropy Regularization in Large Reasoning Models"></p> <p><img src="https://media.springernature.com/lw685/springer-static/image/art%3A10.1007%2Fs10994-018-5742-0/MediaObjects/10994_2018_5742_Fig1_HTML.png" alt="Rethinking statistical learning theory: learning using statistical invariants | Machine Learning"></p> <p><img src="x-raw-image:///309287d50b2c7989e82d04316d902e9633c0654e86024e1af0a64ea02f6c1f15" alt="Tractable Regularization of Probabilistic Circuits"></p> <p><img src="https://miro.medium.com/v2/resize:fit:1400/1*o8GBZ2yCV5v7Zl4dmFN1bQ.png" alt="Important LLM Papers for the Week From 27/10 To 01/11 | by Youssef Hosni | Level Up Coding"></p> <p><img src="https://assets.pubpub.org/os112n3z/Updated%20Figure%201-11759437308940.png" alt="Multitask Learning 1997–2024: Part I Fundamentals · Issue 7.3, Summer 2025"></p> <p><img src="x-raw-image:///75d27427a658b4d4f86c6448d9213b5879944bcafbe913979132acd304b047fe" alt="Explore Data Left Behind in Reinforcement Learning for Reasoning Language Models"></p> <p><img src="https://quantumzeitgeist.com/wp-content/uploads/Image_fx-2026-01-15T080416.646.jpg" alt="Image Super-Resolution Achieves Efficiency Via Individualized Exploratory Attention, Rethinking Token Similarities"></p> <p><img src="https://dfzljdn9uc3pi.cloudfront.net/2024/cs-2016/1/fig-3-full.png" alt="Predictive maintenance in Industry 4.0: a survey of planning models and machine learning techniques [PeerJ]"></p> <p><img src="x-raw-image:///611aa78060593724ccafe8931b21f036593d5d7828eba46d9d42810fd3ae4d14" alt="Differential Smoothing Mitigates Sharpening and Improves LLM Reasoning"></p> <p><img src="x-raw-image:///2efc5f23ded66f70fd7104e2f9ae68781794de1d3d54dcc1c28d6ef5fdf32dee" alt="Rethinking KL Regularization in RLHF: From Value Estimation to Gradient Optimization"></p> <p><img src="https://pbs.twimg.com/media/G7-XjLnagAEgcbu.jpg" alt="Lokesh Mishra (@exomishra) / Posts / X"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/boosting-rl-based-visual-reasoning-with-selective-adversarial-entropy-intervention-3.png" alt="论文评述] Boosting RL-Based Visual Reasoning with Selective Adversarial Entropy Intervention"></p>]]></turbo:content>
</item><item turbo="true">
<turbo:extendedHtml>true</turbo:extendedHtml>
<link>https://servistopauto.ru/rethinking-models/187-Rethinking-machine-unlearning-for-large-language-models-99-sajin.html</link>
<author>trinitydoramy</author>
<category>rethinking models</category>
<pubDate>Thu, 19 Feb 2026 20:32:20 +0300</pubDate>
<turbo:content><![CDATA[<header><h1>Rethinking machine unlearning for large language models (99) 사진</h1></header><p><img src="https://blog.ml.cmu.edu/wp-content/uploads/2024/09/advcomp2.png" alt="Rethinking LLM Memorization – Machine Learning Blog | ML@CMU | Carnegie Mellon University"></p> <p><img src="x-raw-image:///975572770b42ca5699b76f76126d0f40d2e1e15b3bf34a180b17b3765ad2cc12" alt="Large Language Models for Recommendation: Progresses and Future Direction"></p> <p><img src="https://www.marktechpost.com/wp-content/uploads/2024/09/Screenshot-2024-09-16-at-4.38.43-PM.png" alt="Rethinking LLM Training: The Promise of Inverse Reinforcement Learning Techniques - MarkTechPost"></p> <p><img src="https://www.mdpi.com/ai/ai-06-00108/article_deploy/html/images/ai-06-00108-g001.png" alt="Scrub-and-Learn: Category-Aware Weight Modification for Machine Unlearning"></p> <p><img src="https://icml.cc/media/PosterPDFs/ICML%202025/43469.png?t=1750994981.4391031" alt="ICML Poster Towards LLM Unlearning Resilient to Relearning Attacks: A Sharpness-Aware Minimization Perspective and Beyond"></p> <p><img src="x-raw-image:///f13168603285830fde6165efb1e418b25314f65334094d93ef7971190c8b6b92" alt="Trustworthy LLM and an Unlearning Approach"></p> <p><img src="x-raw-image:///c4d6bc7502ac0bd96235ecca72652639df5c3ca52d9a38325a7605d16f13b26c" alt="LLM Unlearning on Noisy Forget Sets: A Study of Incomplete, Rewritten, and Watermarked Data"></p> <p><img src="https://csdl-images.ieeecomputer.org/trans/tq/2023/04/figures/liu8-3194884.gif" alt="Learn to Forget: Machine Unlearning via Neuron Masking"></p> <p><img src="x-raw-image:///d62f015c5537611556cef61082d8240a25d93b4347324fa539dfc14771316caa" alt="Machine Unlearning"></p> <p><img src="https://i-blog.csdnimg.cn/direct/ce0c477c51ef4e16898b0f5aa6166843.jpeg" alt="A CLOSER LOOK AT MACHINE UNLEARNING FOR LARGE LANGUAGE MODELS-CSDN博客"></p> <p><img src="https://lookaside.instagram.com/seo/google_widget/crawler/?media_id=3755601916931930822" alt="In December of 2024, Anthropic released a research paper about a new LLM phenomenon called Alignment Faking. Large Language Models (LLMs) trained with Reinforcement Learning from Human Feedback (RLHF) might fake alignment"></p> <p><img src="https://media.springernature.com/lw1200/springer-static/image/art%3A10.1007%2Fs10462-024-11078-6/MediaObjects/10462_2024_11078_Fig5_HTML.png" alt="Digital forgetting in large language models: a survey of unlearning methods | Artificial Intelligence Review | Springer Nature Link"></p> <p><img src="x-raw-image:///ea9c3d3bcc59558ed94a5b443bab7e796691d6232e4b975a396cb4394b981861" alt="Machine Unlearning"></p> <p><img src="https://csdl-images.ieeecomputer.org/trans/tq/2023/04/figures/liu14-3194884.gif" alt="Learn to Forget: Machine Unlearning via Neuron Masking"></p> <p><img src="x-raw-image:///3aef51db767ca4a0c86e24fdb672be1a2d6f77067350aceb9bb87e6818d35bdf" alt="A Survey on Unlearning in Large Language Models"></p> <p><img src="x-raw-image:///61a648e3840795352500183a2f1461cdbb92bffe4f87683dbbcb50c780c192d3" alt="Unlearn What You Want to Forget: Efficient Unlearning for LLMs"></p> <p><img src="x-raw-image:///e8b280d597464dbab0e36737a211251429c4e6316a32689f853cbc6bbdeec013" alt="Unlearning as multi-task optimization: A normalized gradient difference approach with an adaptive learning rate"></p> <p><img src="x-raw-image:///43d27ef15729085db31d36b24c06f3fefd2a60d1153abdd4e8b2c93154982ff0" alt="A General Framework to Enhance Fine-tuning-based LLM Unlearning"></p> <p><img src="https://media.springernature.com/full/springer-static/image/art%3A10.1057%2Fs41599-025-06004-2/MediaObjects/41599_2025_6004_Fig1_HTML.png" alt="Knowledge-enhanced large language models for automatic lesson plan generation | Humanities and Social Sciences Communications"></p> <p><img src="https://media.springernature.com/lw1200/springer-static/image/art%3A10.1007%2Fs10462-024-11078-6/MediaObjects/10462_2024_11078_Fig4_HTML.png" alt="Digital forgetting in large language models: a survey of unlearning methods | Artificial Intelligence Review | Springer Nature Link"></p> <p><img src="https://cdn.bytez.com/mobilePapers/v2/icml/43469/images/12-0.png" alt="ICML Poster Towards LLM Unlearning Resilient to Relearning Attacks: A Sharpness-Aware Minimization Perspective and Beyond"></p> <p><img src="https://cdn-uploads.huggingface.co/production/uploads/63d10d4e8eaa4831005e92b5/f_gIDHqHc6NftqDmsrvsK.png" alt="Paper page - CLEAR: Character Unlearning in Textual and Visual Modalities"></p> <p><img src="https://jinghanjia.netlify.app/assets/img/publication_preview/wagle.png" alt="Jinghan Jia"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/simplicity-prevails-rethinking-negative-preference-optimization-for-llm-unlearning-1.png" alt="Literature Review] Simplicity Prevails: Rethinking Negative Preference Optimization for LLM Unlearning"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/muse-machine-unlearning-six-way-evaluation-for-language-models-1.png" alt="Literature Review] MUSE: Machine Unlearning Six-Way Evaluation for Language Models"></p> <p><img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/4fe2d8fa1daf2a48b45357bb0648d5b7a93dc279e1cec32a8bd1f0061ac98680/csdah2jpcxqts4oyo7zd" alt="Machine Unlearning in Large Language Models: A Comprehensive Survey with Empirical Insights from the Qwen 1.5 1.8B Model — LessWrong"></p> <p><img src="x-raw-image:///e76d4d46c1c14578150f5d10ff7673076d5a3b574013e2919a1749bf649a1c5e" alt="Large Language Model Unlearning"></p> <p><img src="x-raw-image:///838e9cbe2b58e96569c9b549754264e16276735bf97c102f814c88863c6b038d" alt="Jogging the Memory of Unlearned LLMs Through Targeted Relearning Attacks"></p> <p><img src="x-raw-image:///8925c53478a2def9aa2ed6c9636b0d8217fcd54eabd621b562cb4d4d7ec6f0d3" alt="Rethinking Data Use in Large Language Models"></p> <p><img src="https://imgv2-1-f.scribdassets.com/img/document/986270915/original/c6ab9e3298/1?v=1" alt="Rethinking machine unlearning for large language models by Sijia Liu & Yuanshun Yao & Jinghan Jia & Stephen Casper & Nathalie Baracaldo & Peter Hase & Yuguang Yao & Chris Yuhao Liu"></p> <p><img src="https://pub.mdpi-res.com/ai/ai-06-00108/article_deploy/html/images/ai-06-00108-g006.png?1747991792" alt="Scrub-and-Learn: Category-Aware Weight Modification for Machine Unlearning"></p> <p><img src="https://media.springernature.com/lw1200/springer-static/image/art%3A10.1007%2Fs10462-025-11376-7/MediaObjects/10462_2025_11376_Fig2_HTML.png" alt="A survey on large language models unlearning: taxonomy, evaluations, and future directions | Artificial Intelligence Review | Springer Nature Link"></p> <p><img src="https://media.springernature.com/lw1200/springer-static/image/art%3A10.1038%2Fs41598-025-98483-1/MediaObjects/41598_2025_98483_Fig3_HTML.png" alt="Industrial applications of large language models | Scientific Reports"></p> <p><img src="https://ai2-s2-public.s3.amazonaws.com/figures/2017-08-08/2193d01019909565a04110605b326e393d1c4158/43-Table7-1.png" alt="PDF] RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models | Semantic Scholar"></p> <p><img src="https://i1.rgstatic.net/publication/392371095_Rethinking_Post-Unlearning_Behavior_of_Large_Vision-Language_Models/links/683fb8b9df0e3f544f5ccd80/largepreview.png" alt="PDF) Rethinking Post-Unlearning Behavior of Large Vision-Language Models"></p> <p><img src="x-raw-image:///fdf58e079815f800ebd44052298df24f5dbbc627bb89c247a3ab5831bedfc965" alt="Rethinking Data Use in Large Language Models"></p> <p><img src="https://ai2-s2-public.s3.amazonaws.com/figures/2017-08-08/ec072b1382e8da7480737a8cc5b9ca98dda30c9a/7-Figure2-1.png" alt="PDF] Machine Unlearning of Pre-trained Large Language Models | Semantic Scholar"></p> <p><img src="https://cdn-uploads.huggingface.co/production/uploads/60cd95ee15ecba5f2200304a/IjYMj0aoU9kPusFvLi7CW.jpeg" alt="Paper page - CLEAR: Character Unlearning in Textual and Visual Modalities"></p> <p><img src="https://www.mdpi.com/ai/ai-06-00108/article_deploy/html/images/ai-06-00108-g002.png" alt="Scrub-and-Learn: Category-Aware Weight Modification for Machine Unlearning"></p> <p><img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/ubdaBnhyhtkxYmmRt/wauukkqgcczjpudyscsd" alt="Machine Unlearning in Large Language Models: A Comprehensive Survey with Empirical Insights from the Qwen 1.5 1.8B Model — LessWrong"></p> <p><img src="https://icml.cc/media/PosterPDFs/ICML%202025/44738.png?t=1751559406.580015" alt="ICML Poster Flexible, Efficient, and Stable Adversarial Attacks on Machine Unlearning"></p> <p><img src="https://simg.baai.ac.cn/papers/converted_page_cd4aa02babf0e306f51e89ebd01c1074-02.jpg?x-oss-process=image/format,jpg/interlace,1" alt="Rethinking Machine Unlearning for Large Language Models - 智源社区论文"></p> <p><img src="x-raw-image:///5143483e8d7ff11e4b18b0b22db05af6ef3e8ebba4dc59503368b01b46127b79" alt="Towards Efficient Machine Unlearning with Data Augmentation: Guided Loss-Increasing (GLI) to Prevent the Catastrophic Model Util"></p> <p><img src="x-raw-image:///18a0f66e2e8a32dbf1b378a45a727282c2f804b3cb0fc16a41e69e3c9d5021be" alt="Decoupling Memories, Muting Neurons: Towards Practical Machine Unlearning for Large Language Models"></p> <p><img src="x-raw-image:///4c0686f7167db9e9c191cf4b2d2205305cccb9d27bb092992eacc87a990e9741" alt="SoK: Machine Unlearning for Large Language Models"></p> <p><img src="https://miro.medium.com/v2/resize:fit:1400/1*TIutJBjjdT0DkgiwmgpiVQ.png" alt="Learning to Unlearn: Why Data Scientists and AI Practitioners Should Understand Machine Unlearning | by Raul Vizcarra Chirinos | TDS Archive | Medium"></p> <p><img src="x-raw-image:///7cf333096bb6870f28fe57c6afe24e94d83e186c8c30562c51bcfd225ae70d85" alt="Trustworthy LLM and an Unlearning Approach"></p> <p><img src="https://pic2.zhimg.com/v2-44697ba768a7e9189bb9efa7d787dda9_1440w.jpg" alt="爱可可AI 前沿推介(2.16) - 知乎"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/obliviate-robust-and-practical-machine-unlearning-for-large-language-models-1.png" alt="Literature Review] OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models"></p> <p><img src="x-raw-image:///9f4b7cc7995857dac64150c978be6c4eb43e377802bd6a66beb874533e75fbca" alt="Decoupling Memories, Muting Neurons: Towards Practical Machine Unlearning for Large Language Models"></p> <p><img src="https://i.ytimg.com/vi/RiKhwbINXgE/maxresdefault.jpg" alt="Can LLMs Unlearn? Part 4: The Technical Complexity of it All (continued)"></p> <p><img src="x-raw-image:///51ff2780e95ab4c6aa6dd18b7627108d5218763da2fe48efe8a88517c2e8bfff" alt="LLM Unlearning via Neural Activation Redirection"></p> <p><img src="x-raw-image:///bbb4d52c46ec49f2ba92c769790c8dcb95acf7bc8ad1ff786689d375d98a8d88" alt="Hierarchical Federated Unlearning for Large Language Models"></p> <p><img src="https://miro.medium.com/0*25o-i-Q8CdKyVUpz" alt="Learning to Unlearn: Why Data Scientists and AI Practitioners Should Understand Machine Unlearning | by Raul Vizcarra Chirinos | TDS Archive | Medium"></p> <p><img src="https://www.marktechpost.com/wp-content/uploads/2025/05/Screenshot-2025-05-22-at-11.36.50%E2%80%AFAM.png" alt="Beyond Aha Moments: Structuring Reasoning in Large Language Models - MarkTechPost"></p> <p><img src="x-raw-image:///6006f00687f5c491799810cca82103aefe0ca359f31841f1f089d0b0835e1c6b" alt="RETHINKING LLM UNLEARNING OBJECTIVES: A GRADIENT PERSPECTIVE AND GO BEYOND"></p> <p><img src="https://media.springernature.com/full/springer-static/image/art%3A10.1038%2Fs41524-025-01564-y/MediaObjects/41524_2025_1564_Fig1_HTML.png" alt="Fine-tuning large language models for domain adaptation: exploration of training strategies, scaling, model merging and synergistic capabilities | npj Computational Materials"></p> <p><img src="https://towardsdatascience.com/wp-content/uploads/2024/08/1HjMQTEelpNHUPIio70rYkA.png" alt="Learning to Unlearn: Why Data Scientists and AI Practitioners Should Understand Machine Unlearning | Towards Data Science"></p> <p><img src="x-raw-image:///f4c3d512c17e77433ae438132052d56cfe9eca58cd7d9834e2ef672abcd76287" alt="LLM Unlearning via Neural Activation Redirection"></p> <p><img src="x-raw-image:///7658d6177a9aedd510dd2178659b1269823075fd64293a4500bcac7da6259839" alt="RETHINKING LLM UNLEARNING OBJECTIVES: A GRADIENT PERSPECTIVE AND GO BEYOND"></p> <p><img src="https://media.springernature.com/full/springer-static/image/art%3A10.1038%2Fs41598-025-98483-1/MediaObjects/41598_2025_98483_Fig1_HTML.png" alt="Industrial applications of large language models | Scientific Reports"></p> <p><img src="x-raw-image:///e38d52153a5e60d2a2ba2487392b26398425fe94843caf95179cbb14878734f6" alt="Decoupling Memories, Muting Neurons: Towards Practical Machine Unlearning for Large Language Models"></p> <p><img src="x-raw-image:///8dc1279416b758aca75a09a135d6f8e35a8af7a8080f53fd74f259f68c9d6a63" alt="LLM Unlearning with LLM Beliefs"></p> <p><img src="https://ai2-s2-public.s3.amazonaws.com/figures/2017-08-08/03e6d456b7dd9be6d95276b8ccfa9bcb775f27ff/3-Figure2-1.png" alt="PDF] RKLD: Reverse KL-Divergence-based Knowledge Distillation for Unlearning Personal Information in Large Language Models | Semantic Scholar"></p> <p><img src="https://onlinelibrary.wiley.com/cms/asset/dd87dc52-27c5-4bbf-a8a7-6de4468791e6/aaai12209-fig-0003-m.jpg" alt="A survey of security and privacy issues of machine unlearning - Chen - 2025 - AI Magazine - Wiley Online Library"></p> <p><img src="https://media.springernature.com/full/springer-static/image/art%3A10.1038%2Fs43588-025-00768-y/MediaObjects/43588_2025_768_Fig1_HTML.png" alt="Harnessing large language models for data-scarce learning of polymer properties | Nature Computational Science"></p> <p><img src="https://www.marktechpost.com/wp-content/uploads/2025/03/Screenshot-2025-03-26-at-10.17.37%E2%80%AFPM.png" alt="Vision-R1: Redefining Reinforcement Learning for Large Vision-Language Models - MarkTechPost"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/offset-unlearning-for-large-language-models-2.png" alt="Literature Review] Offset Unlearning for Large Language Models"></p> <p><img src="https://blog.ml.cmu.edu/wp-content/uploads/2022/04/Screenshot-2022-04-01-at-16.14.50.png" alt="Rethinking LLM Memorization – Machine Learning Blog | ML@CMU | Carnegie Mellon University"></p> <p><img src="x-raw-image:///c5ba0266c38e1f0df47ba3a4406725c5b3b871746d57ab6c166647b9c5e3973d" alt="Trustworthy LLM and an Unlearning Approach"></p> <p><img src="x-raw-image:///c4d8944dc1bea6956ccb6e5e4f23ec4e1c4e4eb82bde777797a6df0f9e8057de" alt="A General Framework to Enhance Fine-tuning-based LLM Unlearning"></p> <p><img src="https://ai2-s2-public.s3.amazonaws.com/figures/2017-08-08/2193d01019909565a04110605b326e393d1c4158/10-Figure5-1.png" alt="PDF] RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models | Semantic Scholar"></p> <p><img src="x-raw-image:///718febf488330186fc0e2def894fc27a10c1c3f335c56dfa0313701ff7e790b0" alt="Large Language Models for Recommendation: Progresses and Future Direction"></p> <p><img src="https://imgopt.infoq.com/fit-in/3000x4000/filters:quality(85)/filters:no_upscale()/news/2025/01/large-language-models-scheming/en/resources/1scheming-0-1737103555438.jpg" alt="Major LLMs Have the Capability to Pursue Hidden Goals, Researchers Find - InfoQ"></p> <p><img src="https://media.springernature.com/lw1200/springer-static/image/art%3A10.1007%2Fs11831-025-10436-z/MediaObjects/11831_2025_10436_Fig1_HTML.png" alt="Machine Unlearning for Trustworthy AI: A Systematic Review of Techniques, Challenges, and Applications | Archives of Computational Methods in Engineering | Springer Nature Link"></p> <p><img src="https://ik.imagekit.io/edtechdigit/usaii/content/images/rethinking-ai-adoption-strategic-lessons-from-five-enterprise-level-failures.png" alt="AI Blogs | AI Insights | AI Trends | AI Articles | AI News"></p> <p><img src="https://i1.rgstatic.net/publication/392597840_SoK_Machine_Unlearning_for_Large_Language_Models/links/684a62c4d0be921dfef711a0/largepreview.png" alt="PDF) SoK: Machine Unlearning for Large Language Models"></p> <p><img src="https://swj0419.github.io/detect-pretrain.github.io/static/images/intro2.png" alt="Detecting Pretraining Data from Large Language Models"></p> <p><img src="https://media.springernature.com/lw1200/springer-static/image/art%3A10.1007%2Fs11831-025-10436-z/MediaObjects/11831_2025_10436_Fig4_HTML.png" alt="Machine Unlearning for Trustworthy AI: A Systematic Review of Techniques, Challenges, and Applications | Archives of Computational Methods in Engineering | Springer Nature Link"></p> <p><img src="https://res.cloudinary.com/lesswrong-2-0/image/upload/f_auto,q_auto/v1/mirroredImages/4b76a736e1a3d7727f34ad4003fb62529583993d3ed86ccccf092f3bf5d0da3b/pnr78u2prtslcqkq7gqg" alt="Machine Unlearning in Large Language Models: A Comprehensive Survey with Empirical Insights from the Qwen 1.5 1.8B Model — LessWrong"></p> <p><img src="https://i1.rgstatic.net/publication/380757333_Single_Image_Unlearning_Efficient_Machine_Unlearning_in_Multimodal_Large_Language_Models/links/664d657d0b0d2845744ea7dd/largepreview.png" alt="PDF) Single Image Unlearning: Efficient Machine Unlearning in Multimodal Large Language Models"></p> <p><img src="https://figures.semanticscholar.org/d0b02eb4f0d3efd884b49450efc88145bdf49abc/4-Table1-1.png" alt="PDF] Rethinking machine unlearning for large language models | Semantic Scholar"></p> <p><img src="x-raw-image:///413a82b4549e5f2644b0e61d04c6f8c5ece09792b92291ab4521d1562c028745" alt="MITIGATING MEMORIZATION IN LANGUAGE MODELS"></p> <p><img src="https://lookaside.fbsbx.com/lookaside/crawler/media/?media_id=4071512173120505" alt="Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling (Shanghai AI Laboratory, February 2025) Paper: [https://arxiv.org/abs/2502.06703](https://arxiv.org/abs/2502.06703) Abstract: "Test-Time Scaling (TTS) is an important method for ..."></p> <p><img src="https://arxiv.org/html/2406.15796v1/x1.png" alt="Rethinking Entity-level Unlearning for Large Language Models | AI Research Paper Details"></p> <p><img src="https://pbs.twimg.com/media/GGV94YkaUAAtAF4.jpg" alt="🚨 Important + fun collaboration on this new position paper ➡️➡️ "Rethinking Machine Unlearning for Large Language Models": ▪️ Eliminate undesirable data influence (e.g., sensitive/illegal info), while maintaining integrity of essential knowledge"></p> <p><img src="https://csdl-images.ieeecomputer.org/trans/tq/2023/04/figures/liu5-3194884.gif" alt="Learn to Forget: Machine Unlearning via Neuron Masking"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/editing-as-unlearning-are-knowledge-editing-methods-strong-baselines-for-large-language-model-unlearning-1.png" alt="Literature Review] Editing as Unlearning: Are Knowledge Editing Methods Strong Baselines for Large Language Model Unlearning?"></p> <p><img src="https://i1.rgstatic.net/publication/396629089_The_gains_do_not_make_up_for_the_losses_a_comprehensive_evaluation_for_safety_alignment_of_large_language_models_via_machine_unlearning/links/68f3fe8d7d9a4d4e870ac476/largepreview.png" alt="PDF) The gains do not make up for the losses: a comprehensive evaluation for safety alignment of large language models via machine unlearning"></p> <p><img src="https://substackcdn.com/image/fetch/$s_!7Dqg!,f_auto,q_auto:best,fl_progressive:steep/https%3A%2F%2Fcareylening.substack.com%2Fapi%2Fv1%2Fpost_preview%2F151640738%2Ftwitter.jpg%3Fversion%3D4" alt="Can LLMs Unlearn? Part 4: The Technical Complexity of it All (continued)"></p> <p><img src="x-raw-image:///8c15ffebed13961663631f62e7ce3788350b97446ef04ac335346e4244128831" alt="A Survey of Machine Unlearning Author Published Journal Title Version DOI Rights statement Downloaded from Funder(s) Grant ident"></p> <p><img src="https://dl.acm.org/cms/asset/81f43156-04f2-4bbc-b65f-06952925fa20/3746252.3761379.key.jpg" alt="Efficient Knowledge Graph Unlearning with Zeroth-order Information | Proceedings of the 34th ACM International Conference on Information and Knowledge Management"></p> <p><img src="https://www.mdpi.com/ai/ai-06-00108/article_deploy/html/images/ai-06-00108-g003.png" alt="Scrub-and-Learn: Category-Aware Weight Modification for Machine Unlearning"></p> <p><img src="https://imgv2-2-f.scribdassets.com/img/document/974072258/original/b3125f8b39/1?v=1" alt="Jeena NLP Project | PDF | Cognitive Science"></p> <p><img src="https://www.marktechpost.com/wp-content/uploads/2024/06/Screenshot-2024-06-16-at-8.53.00-AM.png" alt="Innovative Approaches in Machine Unlearning: Insights and Breakthroughs from the first NeurIPS Unlearning Competition on Efficient Data Erasure - MarkTechPost"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/rethinking-machine-unlearning-in-image-generation-models-4.png" alt="论文评述] Rethinking Machine Unlearning in Image Generation Models"></p> <p><img src="x-raw-image:///bab4914b57fceb3f6d2c137cea0e8781d892706c0f2fa2660dc3ae139f20dc4c" alt="A General Framework to Enhance Fine-tuning-based LLM Unlearning"></p> <p><img src="https://moonlight-paper-snapshot.s3.ap-northeast-2.amazonaws.com/arxiv/machine-unlearning-in-large-language-models-3.png" alt="论文评述] Machine Unlearning in Large Language Models"></p>]]></turbo:content>
</item></channel></rss>