This page list the tools contained by the AI Toolbox.
| Tool name | Short description | Integration |
|---|---|---|
| LLM Tool | Large Language Models as deployable services from the HuggingFace model hub | deep |
| Object detection tool | Object detection tool based on Ultralytics models | deep |
| Demo Tool | Example tool for presenting deep integration int the AI Toolbox | deep |
| Prompt Context Injection Tool | Injecting context into NLP prompts | deep |
| 6DoF Pose Estimation Tool | 6 Degrees-of-Freedom pose estimation for robotic applications | shallow |
| Object detection with YoloV8 | Real-time object detection using YOLOv8 architecture | shallow |
| Image segmentation detection with BiRefNet | High-accuracy image segmentation using BiRefNet backbone | shallow |
| Optical Character Recognition with TrOCR | Text recognition from images using Transformer-based OCR | shallow |
| Large Language Model using DeepSeek-R1-0528-Qwen3-8B | Deployment-ready DeepSeek-R1-0528-Qwen3-8B LLM for advanced language tasks | shallow |
| Zero Shot Object Detection with OWLv2 | Object detection without prior training using OWLv2 models | shallow |
| Depth-based Isolation Forest Feature Importance | Interpretable Anomaly Detection with DIFFI | shallow |
| Extended Isolation Forest Feature Importance | Interpretable Anomaly Detection with ExIFFI | shallow |
| Mobile Segment Anything | Mobile Segment Anything | shallow |
| Segment Anything Model 3 | SAM 3 is an AI model that segments and tracks every instance of a concept across images and videos using text or image prompts. | shallow |
| Real-Time Detection Transformer (RT-DETR) | RT-DETR is a cutting-edge end-to-end object detector that provides real-time performance while maintaining high accuracy | shallow |
| Foundation Pose | Unified 6D Pose Estimation and Tracking of Novel Objects | shallow |
| DeepSeek-OCR | Contexts Optical Compression | shallow |
| Qwen3.5 | Qwen3.5 is a high-efficiency multimodal foundation model featuring hybrid architecture and advanced reasoning across 201 languages. | shallow |
| GPT-OSS 120B | OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. | shallow |
| Tabular Foundation Models | Foundation Model for Tabular Data | shallow |
| Lag-Llama | Open-source foundation model for time series forecasting | shallow |
| LayoutLMv3 | A unified Document AI model using ViT-style patches and joint masking to achieve state-of-the-art document understanding. | shallow |
| Embedding Tool | Text embedding generation for semantic search, clustering, retrieval, and similarity tasks | shallow |
| Reranker Tool | Document reranking for improving retrieval precision in search and RAG pipelines | shallow |
| Speech-to-Text Tool | Automatic speech recognition and speech translation for multilingual audio transcription | shallow |
| Text-to-Speech Tool | Multilingual text-to-speech and voice cloning for natural speech generation | shallow |
| Document Parsing Tool | Structured document conversion and PDF understanding with layout, table, and content extraction | shallow |
| Image Classification Tool | Image classification for assigning category labels to images without object localization | shallow |
| Multi-Object Tracking Tool | Real-time multi-object tracking across video frames for surveillance, robotics, and analytics | shallow |
| Human Pose Estimation Tool | Human keypoint and skeletal pose estimation for movement analysis and interaction understanding | shallow |
| Multimodal VLM Tool | Vision-language reasoning over images and text for captioning, visual question answering, and multimodal assistance | shallow |
| Time-Series Anomaly Detection Tool | Detection of unusual temporal patterns and anomalies in sensor, finance, and operational time series | shallow |
| Tabular Prediction Tool | Classification and regression on structured tabular datasets for practical machine learning applications | shallow |
| Synthetic Data Generation Tool | Generation of synthetic datasets for privacy-preserving training, augmentation, and simulation workflows | shallow |
| Explainability Tool | Model interpretability and explanation for tabular, vision, and deep learning systems | shallow |