Siper
βThe code and data are available at https://github.com/DongdingLin/SiPeR.β
What the smart people are recommending. 7865 books, tools, and products endorsed by the thinkers absorb.md tracks. Ranked by how many times each has been recommended across compiled podcasts, papers, posts, and tweets.
βThe code and data are available at https://github.com/DongdingLin/SiPeR.β
βCode and optimized prompts are available at https://github.com/TUMLegalTech/icail2026-llm-judge-gaming.β
βWe systematically address these questions on the LEXam benchmarkβ
βThe dataset is available at https://huggingface.co/datasets/google/RSRCC.β
βconfirming Chorus as a practical tool for generating high-quality deliberation data suitable for online discourse analysisβ
βThe framework was deployed on the Deliberate platformβ
βAn accompanying open-source interactive tool, the Co-creation Provenance Lab, enables policymakers to audit and iteratively improve summaries, establishing genuine human-in-the-loop oversight at scaleβ¦β
βThe code related to this work is available at https://github.com/zwhong714/Hybrid-Policy-Distillation.β
βThe source code of this paper was available at: https://anonymous.4open.science/r/MSR-MEL-C21E/.β
βwe instantiate this paradigm via EVIAN (Explainable Visual Instruction-tuning Data AuditiNg), an automated framework that evaluates these components along the orthogonal axes of Image-Text Consistencyβ¦β
βThis paper proposes LayerTracer, an architecture-agnostic end-to-end analysis framework compatible with any LLM architecture.β
βThe source code and dataset used in this paper are publicly available on Github repository: https://github.com/ChenShuai00/MAGenIdeas.β
βThe demo is available at https://huggingface.co/spaces/cshuai20/MAGenIdeas.β
βSource code is available at: https://github.com/ErrEqualsNil/HaS.β
βCode, data and statistical scripts are available at https://github.com/julia-nixie/ConceptFrameMet.β
βThe dataset is available at: https://github.com/slanglab/RespondeoQAβ
βCode is available at https://github.com/balaboom123/signdata-slt.β
βOur project page is at https://ucsc-vlaa.github.io/AgentPressureBench .β
βThe New York Times story: https://www.nytimes.com/2024/12/12/technology/ev-williams-twitter-medium-mozi.html?smid=nytcore-ios-share&referringSource=articleShareβ
βTo bridge this gap, we propose Language-Agnostic Utility-driven Reranker Alignment (LAURA), which aligns multilingual evidence ranking with downstream generative utility.β
βThe items, per-model responses and complete leaderboard are published as a browsable web interface at https://actubench.de/en/, allowing readers and practitioners to inspect individual items without aβ¦β
βWe release the benchmark (https://github.com/lunyiliu/GaoYao).β
βWe fill the gap with cukereuse, an open-source Python CLI combining exact hashing, Levenshtein ratio, and sentence-transformer embeddings in a layered pipeline, released alongside an empirical corpus β¦β
βScott MacDonald speaks with WCMβs Sloane Payne and Dave Joerger on scaling culture and preserving identity through growth. https://www.capitalallocators.com/podcast/wcm-investment-management/β
βV6 from @PixVerse_ is now live on Replicate.β
βTry now: https://replicate.com/pixverse/pixverse-v6β
βCode & models will be public at https://anticdimi.github.io/lexis.β
βCode is available at https://github.com/visinf/MARCO .β
βClaire used Claude Design to redesign my newsletter. I think it's a winner. https://lennys-product-zoneee.vercel.app/β
βGPT Image 2 from @openai is now on Replicate Killer photorealism and crisp text rendering with strong adherence to layouts, UI, and design use cases.β
βOpen weights. Agentic coding that actually finishes.β
βThis paper presents FΒ²LP-AP (Fast and Flexible Label Propagation with Adaptive Propagation Kernel), a training-free, computationally efficient framework...β
βRead our research: https://research.perplexity.ai/articles/advancing-search-augmented-language-modelsβ
βAI natives can now use Kimi K2.6 on Together AI and benefit from reliable inference for production-scale autonomous agent workflows.β
βHow to Draw a Candlestick Chart in R? β Both ggplot2 and plotlyβ
βHow to Draw a Candlestick Chart in R? β Both ggplot2 and plotlyβ
βFinally, someone is doing for emotional intelligence what has already happened for βexerciseβ: Dr Brackett delineates the clear actionable steps (protocols) to take in real time to develop these skillβ¦β
βWeβre also introducing GPT-5.5 Pro for Pro, Business, and Enterprise users in ChatGPT.β
βHah, I ended up buying the V8 mentioned in that threadβ
βlater managed to track down a VR1280!β
βDeep Agents Deploy: an open alternative to Claude Managed Agentsβ
βDive into the technical details β https://deepmind.google/blog/decoupled-diloco/?utm_source=x&utm_medium=social&utm_campaign=&utm_content=β
βIt builds on 2οΈβ£ earlier advances: Pathways: an AI system that connects different computer chips...β
βDiLoCo: an approach to minimize the bandwidth needed across distributed centers.β
βTogether as Decoupled DiLoCo, it can tackle the key challenge of training at scale.β
βRead more on @FastCompany β https://t.co/BNm6gLWS4Wβ
βWe introduce Auto-ART, an open-source framework that operationalizes identified gaps: 50+ attacks, 28 defense modules, the Robustness Diagnostic Index (RDI), and gradient-masking detection.β
βI endorse everything in this manifesto.β
βMy digital twin is getting good. Thanks @blevlabs !β
βsome more references in footnotes https://www.swyx.io/decadeβ
βNew Huberman Lab Essentials episode out now: 30 minutes, key takeaways only.β
βNew Huberman Lab Essentials episode out now: 30 minutes, key takeaways only. @erichjarvisβ
βListen to Catβ
βLumos is an online debugging framework that exposes application-level bug provenances... Lumos provides developers with enough evidence to identify a bug's root cause, while incurring low runtime overβ¦β
βMy first two TiKZ Sparks unicorns from DeepSeek v4.β
βMy first two TiKZ Sparks unicorns from DeepSeek v4. (Expert mode, from the DeepSeek site, which is supposed to be v4 Pro according to the release)β
βLove using auto-review. Itβs my new default modeβ
βEvery university needs a @mroth78 as president.β
βGift link: Yale Has Come Up With a Surefire Way to Make a Terrible Situation Worse https://t.co/LN1OWwM5hXβ
βDeepSeek V4 Pro is now available on Together AI. DeepSeek V4 Flash coming soon. Try it now: https://www.together.ai/models/deepseek-v4-proβ
βMight be a good project to build with Hermes/OpenClaw.β
βMe? I made an AI to help me see the AI community here on X in a new way: https://t.co/8L5xphk0qQ which helps me see the whales and the small accounts. So I know it is possible.β
βTalked to @ListenLabs co-founder + CTO @florian_jue in the latest Max Agency.β
βCome join us! https://www.langchain.com/careersβ
βTalked to @ListenLabs co-founder + CTO @florian_jue in the latest Max Agency.β
βIn this work, we present SCALA (Signaling CA with Local Attraction), a novel non-hierarchical cellular automaton decoder for quantum repetition and toric codes.β
βBy evaluating SCALA alongside the hierarchical CA decoder proposed by Harrington, we provide a direct comparison between non-hierarchical and renormalization-group-style local decoding strategies.β
βWe also introduce the QFrame python library, which is used to automate the construction of quantum circuits that represent reciprocal transforms.β
βWe present pygridsynth, an open-source Python library for ancilla-free approximate Clifford+$T$ synthesis... Taken together, these features make pygridsynth a Python-native platform for high-precisionβ¦β
βwe propose StructMem, a structure-enriched hierarchical memory framework that preserves event-level bindings and induces cross-event connections. By temporally anchoring dual perspectives and performiβ¦β
βsee https://github.com/zjunlp/LightMemβ
βthis work introduces A-THENA, a lightweight early intrusion detection system (EIDS) that significantly extends preliminary findings on time-aware encodings.β
βwe introduce TEmBed, the Tabular Embedding Test Bed, a comprehensive benchmark for systematically evaluating tabular embeddings across four representation levels: cell, row, column, and table.β
βCode and data: https://github.com/WujiangXu/AEL.β
βOur code is publicly available at https://github.com/mikumifa/GS-Quant.β
βOur code is available at https://github.com/baowenxuan/Ramen .β
βTo support reproducibility and further research, we will publicly release our evaluation benchmark, preference training dataset, and code at https://pegah-kh.github.io/projects/prompts-override-visionβ¦β
βTo support reproducibility and further research, we will publicly release our evaluation benchmark, preference training dataset, and code at https://pegah-kh.github.io/projects/prompts-override-visionβ¦β