Research Projects
Our research work across inference optimization, computer vision, and quantitative finance.
Completed Research
PyTorch native INT8 quantization API
PublishedINT8 tensor subclass for PyTorch enabling up to 4× memory reduction with optimized CUDA/Triton kernels. Merged into TorchAO.
LLM Serving Frameworks Comparison
Coming SoonBenchmarking vLLM, SGLang, and HuggingFace TGI across agentic workflows, long-context, and high-throughput scenarios.
Current Research
nano-vLLM
In ProgressMinimalist implementation of vLLM for educational purposes and rapid prototyping of inference optimizations.
ViT Robustness via Occlusion
In ProgressImproving Visual Transformer robustness using occlusion generators for better generalization.
Sparse Frame Selector
In ProgressBuilding sparse frame selection methods for faster video reasoning and efficient temporal understanding.
Prediction Market Analysis
In ProgressUsing LLMs for prediction market analysis and probabilistic forecasting.
Event-Based Trading Agents
In ProgressDeveloping LLM-powered trading agents that react to market events and news.
Interested in collaborating on research? Contact us at daniel@aerlabs.tech or shubham@aerlabs.tech