Nov. 4, 2024
|
ShadowKV: A High-Throughput Inference System for Long-Context LLM Inference
|
MarkTechPost | AI |
|
EDLM: A New Energy-based Language Model Embedded with Diffusion Framework
|
MarkTechPost | AI |
|
Nanoscale transistors could enable more efficient electronics
|
MIT News - Machine learning |
|
Forthcoming machine learning and AI seminars: November 2024 edition
|
ΑΙhub |
|
New Research Finds Sixteen Major Problems With RAG Systems, Including Perplexity
Mars
|
Unite.AI |
|
Natural Language Generation Inside Out: Teaching Machines to Write Like Humans
|
Machine Learning Mastery |
|
pEBR: A Novel Probabilistic Embedding based Retrieval Model to Address the Challenges of Insufficient Retrieval for Head Queries and Irrelevant Retrieval for Tail Queries
Mars
|
MarkTechPost | AI |
|
AI Utilities: Top 15 Use cases & case studies
SpaceX
|
AIMultiple |
|
Top 25 AI Assistants in 2025
Mars
|
MarkTechPost | AI |
|
SMART Filtering: Enhancing Benchmark Quality and Efficiency for NLP Model Evaluation
Mars
|
MarkTechPost | AI |
|
It’s The End Of The Legal Industry As We Know It
Mars
|
Artificial Lawyer |
|
SearchGPT: ChatGPT’s New Web Search Tool
|
Robot Writers AI |
|
Meet Hertz-Dev: An Open-Source 8.5B Audio Model for Real-Time Conversational AI with 80ms Theoretical and 120ms Real-World Latency on a Single RTX 4090
|
MarkTechPost | AI |
|
LLaMA-Berry: Elevating AI Mathematical Reasoning through a Synergistic Approach of Monte Carlo Tree Search and Enhanced Solution Evaluation Models
|
MarkTechPost | AI |
|
Aggregate-and-Adapt Natural Language Prompts for Downstream Generalization of CLIP
|
Apple ML Research |
|
Nov. 3, 2024
|
Top 15 UEBA Use Cases For Today’s SOCs
Mars
|
AIMultiple |
|
Future Token Prediction Model FTP: A New AI Training Method for Transformers that Predicts Multiple Future Tokens
|
MarkTechPost | AI |
|
Efficient Function Calling in Small-Scale LLMs: A Game-Changer for AI Reasoning Tasks
|
MarkTechPost | AI |
|
Tokenformer: The Next Generation of Transformer Architecture Leveraging Tokenized Parameters for Seamless, Cost-Effective Scaling Across AI Applications
|
MarkTechPost | AI |
|
Understanding Memorization in Diffusion Models: A Statistical Physics Approach to Manifold-Supported Data
|
MarkTechPost | AI |
|
Trajectory Flow Matching (TFM): A Simulation-Free Training Algorithm for Neural Differential Equation Models
|
MarkTechPost | AI |
|
OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization
|
MarkTechPost | AI |
|
This AI Paper from Google Research Introduces Speculative Knowledge Distillation: A Novel AI Approach to Bridging the Gap Between Teacher and Student Models
|
MarkTechPost | AI |
|
Meta AI Releases Sparsh: The First General-Purpose Encoder for Vision-Based Tactile Sensing
|
MarkTechPost | AI |
|
Decoding Arithmetic Reasoning in LLMs: The Role of Heuristic Circuits over Generalized Algorithms
NASA
|
MarkTechPost | AI |
|
Nov. 2, 2024
|
Leopard: A Multimodal Large Language Model (MLLM) Designed Specifically for Handling Vision-Language Tasks Involving Multiple Text-Rich Images
|
MarkTechPost | AI |
|
Cornell Researchers Introduce QTIP: A Weight-Only Post-Training Quantization Algorithm that Achieves State-of-the-Art Results through the Use of Trellis-Coded Quantization (TCQ)
|
MarkTechPost | AI |
|
Anthropic Launches Visual PDF Analysis in Latest Claude AI Update
|
Unite.AI |
|
Multi-Scale Geometric Analysis of Language Model Features: From Atomic Patterns to Galaxy Structures
|
MarkTechPost | AI |
|
Researchers at KAUST Use Anderson Exploitation to Maximize GPU Efficiency with Greater Model Accuracy and Generalizability
NASA
|
MarkTechPost | AI |
|
|