What Are Auto-regressive Models? A Deep Dive and Typical Use Cases
blog.pangeanic.com·1d
🤖Software Engineering with AI
Flag this post
MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling
arxiv.org·7h
🤖Software Engineering with AI
Flag this post
Spatial Secrets: Unleashing Language Models with Unexpected Masking by Arvind Sundararajan
dev.to·7h·
Discuss: DEV
🧬Computational Neuroscience
Flag this post
Fast, Scalable LDA in C++ with Stochastic Variational Inference
github.com·21h·
Discuss: r/cpp
🤖Software Engineering with AI
Flag this post
Writing an LLM from scratch, part 27 – what's left, and what's next?
gilesthomas.com·12h·
Discuss: Hacker News
🤖Software Engineering with AI
Flag this post
Detailed Technical Documentation on AI Implementation Logic (Taking Large Language Models as an Example )
nbtab.com·3h·
Discuss: DEV
🤖Software Engineering with AI
Flag this post
An open dataset of Chinese duration expressions
nature.com·22h
🧬Computational Neuroscience
Flag this post
Hybrid-Attention models are the future for SLMs
inference.net·10h·
Discuss: Hacker News
🧬Computational Neuroscience
Flag this post
Post-training methods for language models
developers.redhat.com·5h
🤖Software Engineering with AI
Flag this post
Reversal Invariance in Autoregressive Language Models
arxiv.org·7h
🧬Computational Neuroscience
Flag this post
[P] triplet-extract: GPU-accelerated triplet extraction via Stanford OpenIE in pure Python
reddit.com·10h·
🧬Computational Neuroscience
Flag this post
Prompt Injection as an Emerging Threat: Evaluating the Resilience of Large Language Models
arxiv.org·7h
🤖Software Engineering with AI
Flag this post
Automating error analysis for AI agents – what works and doesn't
atla-ai.com·2h·
Discuss: Hacker News
🤖Software Engineering with AI
Flag this post
Explore More, Learn Better: Parallel MLLM Embeddings under Mutual Information Minimization
arxiv.org·7h
🧬Computational Neuroscience
Flag this post
ParaScopes: What do Language Models Activations Encode About Future Text?
arxiv.org·7h
🤖Software Engineering with AI
Flag this post
Benchmarking Large Language Models and Privacy Protection
priv.gc.ca·1d
🤖Software Engineering with AI
Flag this post
Deflanderization for Game Dialogue: Balancing Character Authenticity with TaskExecution in LLM-based NPCs
paperium.net·22h·
Discuss: DEV
🤖Software Engineering with AI
Flag this post