Book Your Demo with Deci at GTC 2024

We’re exhibiting at the 2024 GTC AI conference.

Whether you are looking to achieve real-time performance, reduce model size, or increase throughput, Deci's NAS-based model optimization can help you deliver seamless inference in any environment.

Book a meeting with Deci’s experts to learn how Deci’s Automated Neural Architecture Construction Engine can empower your team to build, optimize, and deploy highly accurate and efficient models for your deep learning and generative AI applications.

See you at booth #1501!

Large language models present a unique set of challenges when implemented in production environments. Their extensive size and complexity result in high operational costs and reduced inference speed, especially when scaling up. Traditional optimization techniques, tailor-made for smaller models, fall short for these models.

This talk illuminates the path toward efficient model design with a spotlight on DeciCoder, an open-source code generation LLM. DeciCoder was generated with Neural Architecture Search and designed to maximize performance on the NVIDIA A10 GPU. We'll explore the intricate methodologies that define the design and training phases of DeciCoder and demonstrate the transformative potential of neural architecture search for generative resource-efficient LLMs. Gain valuable insights to enhance your research, development, and deployment strategies.

Can you truly have your high-performance GenAI and deploy it cost-effectively, too? Join us to find out.

Yonatan Geifman is the CEO and Co-Founder of Deci. Before co-founding Deci, Yonatan was a member of Google AI’s MorphNet team. He holds a PhD in Computer Science from the Technion-Israel Institute of Technology and a B.Sc. and M.Sc. in Mathematics and Computer Science from Ben-Gurion University in Israel.

His research focused on making Deep Neural Networks (DNNs) more applicable for mission-critical tasks. It has been published and presented at leading global conferences including the Conference on Neural Information Processing Systems (NeurIPS) and International Conference on Machine Learning (ICML).

See a live stream inference of a multimodal model on a NVIDIA Jetson Orin Nano, and learn how Deci's foundation models:

Runs at unparalleled accuracy and speed, outperforming other well-known models.
Is fully compatible with high-performance inference engines like NVIDIA® TensorRT™ and supports INT8 quantization.
Leverages cutting-edge techniques, such as attention mechanisms, quantization aware blocks, and reparametrization at inference time.
Is easy to fine-tune to achieve SOTA results using Google Colab Notebook with the SuperGradients open-source library.

Tailored for Your Task & Performance Goals

Run a multi-objective search to generate an architecture optimized for several parameters (accuracy, throughput, latency, model size, and memory footprint)

Hardware Aware Neural Architecture Search

Take hardware constraints into account to unlock optimal efficiency that hit your performance targets

Outperforms any SOTA Computer Vision Model

Deliver the best accuracy and speed for your specific use case that outperform SOTA open-source neural networks

Meet with Deci
at GTC 2024

Catch Our Session on Gen AI Cost-Efficiency

Tuesday, Mar 19 | 4:30 PM - 4:55 PM PDT

Can High Performance also be Cost-Efficient when it Comes to Generative AI?

Don't Miss the Excitement at Booth #1501!

Explore Our Generative AI Models and Inference SDK

See Deci's Multimodal CV Model at the Edge

Talk with Deci's Experts 1-on-1

AutoNAC (Neural Architecture Search Engine)

Tailored for Your Task & Performance Goals

Hardware Aware Neural Architecture Search

Outperforms any SOTA Computer Vision Model

See You at Booth #1501!

Meet with Deciat GTC 2024

Catch Our Session on Gen AI Cost-Efficiency

Tuesday, Mar 19 | 4:30 PM - 4:55 PM PDT

Can High Performance also be Cost-Efficient when it Comes to Generative AI?

Don't Miss the Excitement at Booth #1501!

Explore Our Generative AI Models and Inference SDK

See Deci's Multimodal CV Model at the Edge

Talk with Deci's Experts 1-on-1

AutoNAC (Neural Architecture Search Engine)

Tailored for Your Task & Performance Goals

Hardware Aware Neural Architecture Search

Outperforms any SOTA Computer Vision Model

See You at Booth #1501!

Meet with Deci
at GTC 2024