inarcissuss's Feed

Feeds to Scour
SubscribedAll
Scoured 11,164 posts in 60.1 ms
Use TurboQuant-compatible GGUF models on Windows with LM Studio, Ollama for Windows, and llama.cpp. Covers hardware requirements, which models support TQ4_K_M, and the best current approximation while native TurboQuant rolls out. Read more ›
Feeds
With a great number of tokens comes great responsibility Read more ›
Feeds
所谓“蒸馏攻击”,又被称为模型提取攻击,通常指某家 AI 公司或其关联方通过公开 API 反复调用竞争对手的模型,将对方模型的输出作为训练数据,反向训练出自身的模型,从而在未获授权的情况下复制对方的核心能力,被视为一种新形态的知识产权盗窃手段。Anthropic 在 6 月 10 日致美国参议员蒂姆·斯科特(Tim Scott)和伊丽莎白·沃伦(Elizabeth Warren)的信中指出,阿里巴巴方面的行动目的在于“以非法方式抽取 Claude 的能力”,帮助其内部较小、较弱的模型在广泛问题上给出更高质量的回答。 信中还提到,这些攻击特别针对 Anthropic 的前沿模型 “Mythos Preview”,该模型被公司视作相较于早期 Claude Opus 的重大升级,尤其在编程和网络安全等方向具备更强性能。Anthropic 进一步声称,中国政府在这一过程中“负有共谋责任”,并将其归入中国在人工智能、机器学习及相关技术领域寻求全球主导地位的整体战略之内。 公司警告称,如果此类攻击得逞,可能对美国及其盟友的长期科技竞争力和安全利益构成“生存威胁”。这封信的收件人——斯科特与沃... Read more ›
Feeds
Notion's new Claude agents bring AI directly into your workspace, letting teams write, code, and manage projects. Read more ›
Feeds
A first step into picoCTF binary exploitation, where spamming A’s somehow became a legitimate strategy. Read more ›
Feeds
One of the greatest weaknesses of AI agents that read and understand vast amounts of enterprise data is "hallucination"—the generation of plausible-sounding but factually incorrect information. KAIST researchers have developed a next-generation database technology capable of understanding documents, data and relationships among entities all at once. Read more ›
Feeds
AI agents are a growing priority for enterprises, with many companies interested in deploying them for a wide range of purposes, from software development and marketing to sales and customer support. Most discussions revolve around single AI agents. However, Gartner has seen a 1,445% surge in inquiries about multi-agent systems (MAS) from Q1 2024 to <a class="read_more" href=" Read more ›
Feeds
Empowering everyone to build reliable and efficient software. Read more ›
Feeds
Series — Fine-Tuning, Smallest to Largest: QLoRA (7B) ← you are here In , LoRA let me fine-tune a 1.5B model by freezing it and training tiny adapters. But the frozen base still sat in memory in 16-bit (~3GB). Now I wanted to go to Qwen2.5-7B — and hit a wall that LoRA alone doesn't solve. The problem A 7B model is ~15GB in 16-bit precision. A free-tier T4 GPU has 16GB. It would barely load, with no room left to actually train. The QLoRA insight QLoRA asks the question that naturally follows ... Read more ›
Discussed on DEV
Feeds
🤖AI/MLmedium.com
·
If you are learning Machine Learning, you have probably lived this exact scenario: You spend hours cleaning a dataset, you build a PyTorch… Read more ›
Feeds
Author(s): Ayo Akinkugbe Originally published on Towards AI. Understanding Reinforcement Learning — A Primer Photo by Girl with red hat on Unsplash Introduction: Learning by Trial and Error Imagine teaching a dog to fetch a ball. You don’t hand the dog a manual titled “The Complete Guide to Ball Retrieval.” Instead, you throw the ball, and when the dog brings it back, you give it a treat. When the dog gets distracted and wanders off, you withhold the treat. Over dozens of repetitions, the dog... Read more ›
Feeds
Responsible AI is becoming infrastructure for AI agents: runtime controls, system accountability, human oversight, and safeguards for tools that act Read more ›
Feeds
Large Language Models (LLMs) achieve strong performance across a growing range of domains, yet their scale poses deployment challenges in applications where latency and cost constraints are critical. This paper derives empirical scaling laws for domain-specific LLM compression, quantifying how in-domain and general knowledge performance scale with dataset size, compression ratio, supervision format, and iterative pruning schedule. Using quantita... Read more ›
Feeds
See how Microsoft is transforming software development with agentic workflows, AI-powered automation, and specialized agents across the engineering lifecycle. The post appeared first on . Read more ›
Feeds
# Agent Skills Agent Skills are modular capabilities that extend Claude's functionality. Each Skill packages instructions, metadata, and optional resources (scripts, templates) that Claude uses automatically when relevant. --- This feature is **not** eligible for Zero Data Retention (ZDR). Data is retained according to the feature's standard retention policy. ## Why use Skills Skills are reusable, filesystem-based resources that provide Claude with domain-specific expertise: workflows,... Read more ›
Feeds
AI-Gateway reverse proxy that uses semantic caching and aims to reduce LLM API bills and token costs by 40-70%. - Arnab758/ai-gateway Read more ›
Discussed on Hacker News and Hacker News
Feeds
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems. Read more ›
Feeds
🐛Bug Bountymedium.com
·
Last year, we shared a behind-the-scenes look at how Agoda runs its bug bounty program, the lessons learned, the challenges of striking… Read more ›
Feeds
As I continue learning AI Engineering, I keep discovering something surprising: building AI applications is often less about training… Read more ›
Feeds
Note-taking has always been a difficult task for me. I would rarely write them and, when I happen to write some, they would most certainly never be reviewed. Although there is still some value in writing throwaway notes just for the sake of writing, I wanted to look for a Read more ›
Discussed on Hacker News
Feeds
Sign up or log in to see more results

Keyboard Shortcuts

Navigation

Next / previous post
j/k
Open post
oorEnter
Preview post
v

Post Actions

Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Save / unsave
s

Recommendations

Add interest / feed
Enter
Not interested
x

Go to

Home
gh
Interests
gi
Feeds
gf
Likes
gl
History
gy
Changelog
gc
Settings
gs
Discover
gb
Search
/

General

Show this help
?
Submit feedback
!
Close modal / unfocus
Esc

Press ? anytime to show this help