Research - AER LABS

Research Projects

Our research work across inference optimization, computer vision, and quantitative finance.

V0 BATCH

Published

INT8 tensor subclass for PyTorch enabling up to 4× memory reduction with optimized CUDA/Triton kernels. Merged into TorchAO.

Quantization PyTorch TorchAO

Read Report → GitHub →

Coming Soon

Benchmarking vLLM, SGLang, and HuggingFace TGI across agentic workflows, long-context, and high-throughput scenarios.

Benchmarking vLLM SGLang

Report Coming Soon

V1 BATCH

In Progress

Minimalist implementation of vLLM for educational purposes and rapid prototyping of inference optimizations.

Inference vLLM LLM Serving

In Progress

Improving Visual Transformer robustness using occlusion generators for better generalization.

Computer Vision ViT Robustness

In Progress

Building sparse frame selection methods for faster video reasoning and efficient temporal understanding.

Computer Vision Video Efficiency

In Progress

Using LLMs for prediction market analysis and probabilistic forecasting.

Quant LLM Markets

In Progress

Developing LLM-powered trading agents that react to market events and news.

Quant Agents Trading

Interested in collaborating on research? Contact us at daniel@aerlabs.tech or shubham@aerlabs.tech