Research

Our team investigates a range of advanced machine learning problems primarily involving deep learning and reinforcement learning.

Who we are

The research team at DataRes is a group of highly motivated students, all with an interest in machine learning. We strive to explore novel ideas in this quickly developing field.

From large language models to geometric deep learning, we develop knowledge about and research cutting edge algorithms.

If any of this sounds interesting to you, take a look at past work, and consider applying!

Previous work

In the 2019-2020 academic year, we worked with reinforcement learning and supervised learning problems, which required our members to apply their software engineering, research and critical thinking skills.

The first problem, which you can read about here, was to teach an agent how to gamble in the game of Roulette.

Eventually, our members were able to learn from the agent, and understand they should in fact not gamble at all, as most betting positions are far from being statistically convenient for the gambler.

With the second problem, we took it a step further. We taught our members what a ResNet is, and how we can use it for regression, computer vision problems.

Specifically, we taught an AI how to predict speed from just video frames. This included a lot of preprocessing, one of the most fundamental steps in data science, which allowed our members to learn about the hassles of training a realistic model that generalizes well enough.

Subsequently, in 2022-2023, we focused on transformer models and graph neural networks.

With regards to transfomer models, we worked primarily with BERT, the language model that powers Google Search. Through fine tuning and modifications, our teams were able to use BERT to process time series data, which included using BERT’s power as an image transfomer (see BEiT), and also to predict oil prices over time using text-analysis of news articles.

In the graph neural network space, our teams utilized PyTorch Geometric, a powerful geometric deep learning framework based on PyTorch. Using these nascent ideas in deep learning, they were able to explore the possiblities of representation learning, drawing out information from complex, relational datasets, such as Wikipedia articles, and road networks.

Apply, to learn, to grow, to make a change.

We are excited to meet you, learn from you, and help you grow. If you have any questions, feel free to email Larry Lu

Cheers.