What Are Auto-regressive Models? A Deep Dive and Typical Use Cases
blog.pangeanic.com·1d
🤖Software Engineering with AI
Flag this post
MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling
arxiv.org·7h
🤖Software Engineering with AI
Flag this post
Spatial Secrets: Unleashing Language Models with Unexpected Masking by Arvind Sundararajan
🧬Computational Neuroscience
Flag this post
Fast, Scalable LDA in C++ with Stochastic Variational Inference
🤖Software Engineering with AI
Flag this post
Writing an LLM from scratch, part 27 – what's left, and what's next?
🤖Software Engineering with AI
Flag this post
Knowledge Elicitation with Large Language Models for Interpretable Cancer Stage Identification from Pathology Reports
arxiv.org·7h
🤖Software Engineering with AI
Flag this post
Detailed Technical Documentation on AI Implementation Logic (Taking Large Language Models as an Example )
🤖Software Engineering with AI
Flag this post
An open dataset of Chinese duration expressions
nature.com·22h
🧬Computational Neuroscience
Flag this post
Post-training methods for language models
developers.redhat.com·5h
🤖Software Engineering with AI
Flag this post
Reversal Invariance in Autoregressive Language Models
arxiv.org·7h
🧬Computational Neuroscience
Flag this post
[P] triplet-extract: GPU-accelerated triplet extraction via Stanford OpenIE in pure Python
🧬Computational Neuroscience
Flag this post
Prompt Injection as an Emerging Threat: Evaluating the Resilience of Large Language Models
arxiv.org·7h
🤖Software Engineering with AI
Flag this post
Automating error analysis for AI agents – what works and doesn't
🤖Software Engineering with AI
Flag this post
Explore More, Learn Better: Parallel MLLM Embeddings under Mutual Information Minimization
arxiv.org·7h
🧬Computational Neuroscience
Flag this post
ParaScopes: What do Language Models Activations Encode About Future Text?
arxiv.org·7h
🤖Software Engineering with AI
Flag this post
DEER: Disentangled Mixture of Experts with Instance-Adaptive Routing for Generalizable Machine-Generated Text Detection
arxiv.org·7h
🤖Software Engineering with AI
Flag this post
Benchmarking Large Language Models and Privacy Protection
priv.gc.ca·1d
🤖Software Engineering with AI
Flag this post
Loading...Loading more...