Claudio Greco

Claudio Greco

Machine Learning Scientist & Engineer

Machine Learning Scientist & Engineer

claudiogaetanogreco@gmail.com

claudiogaetanogreco@gmail.com

About me

About me

Hello, I'm Claudio Greco, a Member of Technical Staff at Reka AI working on large multimodal foundation models and multimodal agents.

In the past, I've been a Research Engineer at Alana AI, where I worked on building multimodal foundation models, and a Research Fellow at Heriot-Watt University, where I worked on designing trustworthy conversational agents.

Before that, I obtained a PhD in Cognitive and Brain Sciences at the Center for Mind/Brain Sciences of the University of Trento working in the Language and Vision Research Group under the supervision of Raffaella Bernardi. Before enrolling in my Ph.D program, I obtained B.Sc. and M.Sc. degrees in Computer Science at the University of Bari, where I worked on social network analysis and (conversational) recommender systems, respectively.

Experience

Experience

Sunnyvale, California (Remote)

Reka AI

September 2024 – Present

Member of Technical Staff

Working on large multimodal foundation models and multimodal agents.

Sunnyvale, California (Remote)

Reka AI

September 2024 – Present

Member of Technical Staff

Working on large multimodal foundation models and multimodal agents.

Edinburgh, United Kingdom

Alana AI

March 2023 – August 2024

Research Engineer

  • Designed, developed and trained from scratch a Multimodal (Vision & Language) Large Foundation Model, scaling the training process up to 7 billion parameters.

  • Built an instruction-tuning dataset for Egocentric Video Understanding.

  • Fine-tuned an open-source Multimodal Large Foundation Model on the Egocentric Video Understanding dataset using LoRa, showcasing competitive results compared to Gemini Pro 1.5 and GPT-4V, even surpassing the latter in spatial reasoning, and outperforming open-source models.

Edinburgh, United Kingdom

Alana AI

March 2023 – August 2024

Research Engineer

  • Designed, developed and trained from scratch a Multimodal (Vision & Language) Large Foundation Model, scaling the training process up to 7 billion parameters.

  • Built an instruction-tuning dataset for Egocentric Video Understanding.

  • Fine-tuned an open-source Multimodal Large Foundation Model on the Egocentric Video Understanding dataset using LoRa, showcasing competitive results compared to Gemini Pro 1.5 and GPT-4V, even surpassing the latter in spatial reasoning, and outperforming open-source models.

Edinburgh, United Kingdom

Heriot-Watt University

February 2022 – January 2023

Research Fellow in Conversational Systems

Worked on a project which involved building trustworthy conversational agents able to adapt during conversations to align with human values in the UKRI TAS Node on Trust. During the project, collected a dataset of trust annotations on Amazon Mechanical Turk.

Edinburgh, United Kingdom

Heriot-Watt University

February 2022 – January 2023

Research Fellow in Conversational Systems

Worked on a project which involved building trustworthy conversational agents able to adapt during conversations to align with human values in the UKRI TAS Node on Trust. During the project, collected a dataset of trust annotations on Amazon Mechanical Turk.

Berlin, Germany

SAP

June 2019 – November 2019

Research Internship

Worked on Continual Learning and Multimodal Learning.

Berlin, Germany

SAP

June 2019 – November 2019

Research Internship

Worked on Continual Learning and Multimodal Learning.

Education

Education

Rovereto, Italy

2017 - 2022

PhD degree in Cognitive and Brain Sciences

University of Trento

Thesis title: Transfer Learning and Attention Mechanisms in a Multimodal Setting. Thesis topics: Continual / Transfer Learning, Transformers, and Multimodal Learning. Supervisor: Raffaella Bernardi. Oversight committee: Raffaella Bernardi, Marco Baroni, and Raquel Fernández.

Rovereto, Italy

2017 - 2022

PhD degree in Cognitive and Brain Sciences

University of Trento

Thesis title: Transfer Learning and Attention Mechanisms in a Multimodal Setting. Thesis topics: Continual / Transfer Learning, Transformers, and Multimodal Learning. Supervisor: Raffaella Bernardi. Oversight committee: Raffaella Bernardi, Marco Baroni, and Raquel Fernández.

Bari, Italy

2014 - 2017

Master's degree in Computer Science

University of Bari

Grade: 110/110 cum laude and special commendation by the commission. Thesis on building a conversational content-based recommender system based on hierarchical deep reinforcement learning techniques. Supervisors: Pierpaolo Basile and Giovanni Semeraro.

Bari, Italy

2014 - 2017

Master's degree in Computer Science

University of Bari

Grade: 110/110 cum laude and special commendation by the commission. Thesis on building a conversational content-based recommender system based on hierarchical deep reinforcement learning techniques. Supervisors: Pierpaolo Basile and Giovanni Semeraro.

Bari, Italy

2008 - 2013

Bachelor's degree in Computer Science

University of Bari

Grade: 110/110 cum laude. Thesis on discovering and tracking organizational structures built from event logs over time exploiting social network analysis techniques. Supervisors: Annalisa Appice and Donato Malerba.

Bari, Italy

2008 - 2013

Bachelor's degree in Computer Science

University of Bari

Grade: 110/110 cum laude. Thesis on discovering and tracking organizational structures built from event logs over time exploiting social network analysis techniques. Supervisors: Annalisa Appice and Donato Malerba.