Intern: RS - Foundation Models for Data Management & Lakehouses
IBM Research Scientists are charting the future of Artificial Intelligence, creating breakthroughs in quantum computing, discovering how blockchain will reshape the enterprise, and much more. Join a team that is dedicated to applying science to some of today's most complex challenges, whether it’s discovering a new way for doctors to help patients, teaming with environmentalists to clean up our waterways or enabling retailers to personalize customer service.
Your Role and Responsibilities
This is for a 2024 summer internship with the following start dates: May - August or June - September for quarter system schools.
We are broadly interested in making foundation models (FMs) effective for a range of data management tasks, particularly those related to the management of structured data in enterprise data lakes and lakehouses.
Topics of interest include research on effective and efficient tuning techniques, knowledge-driven reasoning, and causality-driven alignment for better control and run-time performance of FMs and their use in enterprise data tasks. Tasks of interest include semantic enrichment of structured data, semantic data management with metadata and knowledge graphs, code generation for data retrieval with transformations, and various data wangling tasks in the end-to-end data lifecycle in data lakes.
Tuning-related research spans full-space and parameter-efficient tuning techniques with supervised as well as reinforcement learning with reward functions that capture end-use performance. Grounding the generation of tuned models in domain-specific vocabulary, efficient techniques for human-in-the-loop adaptation at inference time, and retrieval augmentation techniques for data management tasks will be of interest.
For knowledge-driven reasoning, formulations and benchmarks that treat the database query-answering process as a knowledge-extraction task will be useful for experimenting with reasoning over database tables at different levels of complexity to improve and expand the reasoning skills of FMs.
For causal alignment, we're interested in formulations that study and show the causal relationships behind the effectiveness of different prompt optimization methods, where a small set of prompt augmentation tokens improves FMs for issues like delusions, alignment, and transfer.
Required Technical and Professional Expertise
- Applicants should be PhD & MS students pursuing graduate studies in computer science and related fields
- Having at least one research publication, preferrably at a top conference in AI or data management
- Familiarity with the basics of data management and data lakes
- Familiarity and working expertise with large language models (LLMs) or other Foundation Models
Preferred Technical and Professional Expertise
Candidates should have basic knowledge in one or more of the following skills:
- Familiarity with ontologies, knowledge graphs, and description logic
- Familiarity with reinforcement learning, causal graphical models, and prompt optimization
About Business Unit
Your Life @ IBM
Being an IBMer means you’ll be able to learn and develop yourself and your career, you’ll be encouraged to be courageous and experiment everyday, all whilst having continuous trust and support in an environment where everyone can thrive whatever their personal or professional background.
Our IBMers are growth minded, always staying curious, open to feedback and learning new information and skills to constantly transform themselves and our company. They are trusted to provide on-going feedback to help other IBMers grow, as well as collaborate with colleagues keeping in mind a team focused approach to include different perspectives to drive exceptional outcomes for our customers. The courage our IBMers have to make critical decisions everyday is essential to IBM becoming the catalyst for progress, always embracing challenges with resources they have to hand, a can-do attitude and always striving for an outcome focused approach within everything that they do.
Are you ready to be an IBMer?
The compensation range and benefits for this position are based on a full-time schedule for a full calendar year. The salary will vary depending on your job-related skills, experience and location. Pay increment and frequency of pay will be in accordance with employment classification and applicable laws. For part time roles, your compensation and benefits will be adjusted to reflect your hours. Benefits may be pro-rated for those who start working during the calendar year.
We consider qualified applicants with criminal histories, consistent with applicable law.
Being You @ IBM