Second-Order Methods for ML

Enhancing ML models with second-order information

Common approaches to training ML models come with some inherent disadvantages because they rely on mathematical methods involving only first-order information (direction of change), but not second-order information (rate of change). This project, titled “Scalable Second-order Methods for Training, Designing, & Deploying Machine Learning Models” aims to leverage second-order methods to improve ML optimization.

Efficient optimization algorithms are essential to enabling many applications of machine learning. However, optimization methods that use only first derivative information can suffer from slow convergence, poor communication, and the need for laborious hyper-parameter tuning. While second-order methods could mitigate many of these disadvantages, they are far less used within the ML community.

In this project, we are advancing the innovative application of second-order information to develop, implement, and apply novel methods to enhance the design, diagnostics, and training of ML models. To accomplish this, we are tackling challenges involved in training large-scale nonconvex ML models from four general angles: high-quality local minima; distributed computing environments; generalization performance; and acceleration. We also aim to develop efficient Hessian-based diagnostics tools for analyzing the training process as well as already-trained models.

To study improvements and applications for our proposed methods, we are developing implementations for both shared-memory and distributed computing environments in the context of improved communication properties; exploiting adversarial data; and the improvement of neural architecture design and search.

Project Team

group

Associated ICSI Group

Research Groups

AI and Big Data Group

We develop innovative methods to extract insights from data. Our work accelerates scientific discovery and underpins transformative AI applications.

account_circle

ICSI Research Team

Michael Mahoney

View Bio

Michael W. Mahoney, PhD, is Vice President, Principal Scientist, and Group Lead for the AI and Big Data group at ICSI.

Amir Gholaminejad

View Bio

Amir Gholaminejad (Gholami), PhD, is a Research Affiliate at ICSI and an Associate Research Scientist at the Berkeley Artificial Intelligence Research and Sky Computing Labs at UC Berkeley.

About

attach_money

Focus Areas

Machine Learning (Supervised, Unsupervised, Reinforcement Learning)

mail

Get in touch

Want to discuss opportunities to work with ICSI? We’d love to hear from you.

Contact Us

location_on

2150 Shattuck Ave., #250
Berkeley, CA 94704

phone

+1 (510) 666-2900

mail

contact @ icsi.berkeley.edu

person

Join Our Team

We’re always looking for researchers, visitors, and collaborators. Explore all the benefits of joining our inclusive, interdisciplinary, impacts-oriented team.

Learn More

dvr

Use Our Services

Apply ICSI brainpower to your organization’s most pressing challenges with our technical and strategy consulting services. Or, use our modern office space for workshops and meetings.

Learn More

Associated ICSI Group

Research Groups

AI and Big Data Group

ICSI Research Team

Michael Mahoney

Amir Gholaminejad

Sponsors

Focus Areas

Research Themes

Machine Learning and Artificial Intelligence

Research Themes

AI for Science and Health

Projects

AI Algorithms for Science at the Edge

Projects

AI for High-Energy Physics

Projects

DNN Attack Detection

Projects

Training Secure and Robust DNNs

Get in touch

Join Our Team

Use Our Services

Who We Are

Our Research

Our Impact

Work With Us

Resources

News

|

Projects

Jump Down:

Enhancing ML models with second-order information

Project Team

Associated ICSI Group

Research Groups

AI and Big Data Group

ICSI Research Team

Michael Mahoney

Amir Gholaminejad

About

Sponsors

Focus Areas

Related

Research Themes

Machine Learning and Artificial Intelligence

Research Themes

AI for Science and Health

Projects

AI Algorithms for Science at the Edge

Projects

AI for High-Energy Physics

Projects

DNN Attack Detection

Projects

Training Secure and Robust DNNs

Get in touch

Join Our Team

Use Our Services

Who We Are

Our Research

Our Impact

Work With Us

Resources

News