13.Generative AI
【R&D Engineer】 日本語
- Position Summary
-
We are an R&D organization dedicated to the research and development of large-scale generative AI technologies for content creation and production in the entertainment domain, including music, film, and games. Generative AI technologies have the potential to transform both consumer lifestyles and the workflows of professional creators, and are expected to become an essential component of the music, film, and gaming industries in the years ahead. By leveraging opportunities to collaborate directly with world-leading entertainment groups across these industries, our team engages in cutting-edge research and development to contribute to Sony Group’s businesses. For more information about our research activities and publications, please visit: https://sony.github.io/creativeai
- Responsibilities
-
In this role, you will work on the design and optimization of Agent-based workflows to enable high-quality video generation. Your responsibilities will include, for example:
(1) Designing and improving Agent workflows to generate natural and temporally consistent video sequences
(2) Leveraging control mechanisms and techniques such as Temporal Attention to ensure stable, high-quality visual outputs
(3) Incorporating self-correction logic into Agent chains to maintain a high level of consistency with input metadata
- Term
-
3 months to 1 year, with a minimum availability of 2 days per week (flexible and open to discussion)
- Required Degree
-
Applicants must be currently enrolled in a master’s or doctoral program.
- Required qualifications
-
- Technical Stack: Strong Python foundation; familiarity with running LLM, Diffusion Models and Agent architectures (e.g., ComfyUI, LangGraph).
- Intuition & Aesthetics: Good engineering intuition for debugging generative "artifacts" and a keen eye for video rhythm and composition.
- Language: Professional proficiency in English (reading, writing, and speaking) is required for daily collaboration.
- Preferred qualifications
-
- Experience in time-series signal processing or audio-visual synchronization (AV-Sync).
- Active in the AIGC (AI generation contents) community with contributions to GitHub projects or custom ComfyUI nodes.
- Product, Service
-
Content creation such as animation and videos
- Development Environment
-
Python, Linux
- Entry period
-
Open until positions are filled
- How to enter
-
Please register on the 2028 Graduate Internship & New Graduate Recruitment My Page.
After completing your registration, please access【ソニーグループ株式会社 新卒採用】banner.
Then fill out and submit the Profile Sheet for【長期有給インターン】.
- Treatment
-
PhD students:2,200 JPY / hour
Bachelor's / Master's students: 2,000 JPY / hour
Compensation will be determined based on skills and experience.
- Location
- Sony City Osaki
- Company
- Sony Group Corporation
14.Multimodal NLP
【R&D Researcher】 日本語
- Position Summary
-
We are an R&D organization dedicated to the research and development of large-scale generative AI technologies for content creation and production in the entertainment domain, including music, film, and games. Generative AI technologies have the potential to transform both consumer lifestyles and the workflows of professional creators, and are expected to become an essential component of the music, film, and gaming industries in the years ahead. By leveraging opportunities to collaborate directly with world-leading entertainment groups across these industries, our team engages in cutting-edge research and development to contribute to Sony Group’s businesses. For more information about our research activities and publications, please visit: https://sony.github.io/creativeai
- Responsibilities
-
Fundamental research in natural language processing such as multimodal learning, multimodal LLM, music/video understanding, agent, reasoning, controllable generative modeling, deep generative models for discrete data, image/audio captioning, text-to-image/audio, vision-language pre-training, commonsense knowledge graphs, large-scale data development, etc.
You will be responsible for a wide range of activities, including paper submissions to top conferences (e.g. ACL, EMNLP, NeurIPS, ICLR, CVPR, etc.), collaborative research with universities and/or business groups, deployment of developed technologies within Sony and/or to third-party products together with product teams, etc.
You will also contribute to improving the efficiency of content creation in Sony's studios for music, movies, and game services by delivering AI-assisted tools developed by R&D.
- Term
-
3 months to 1 year, with a minimum availability of 2 days per week (flexible and open to discussion)
- Required Degree
-
Applicants must be currently enrolled in a doctoral program.
- Required qualifications
-
All of the following criteria are required.
- Master's degree in natural language processing, artificial intelligence, machine learning, or closely related areas OR equivalent practical experience.
- 3 years of experience with Python and Linux/Unix.
- 2 years of experience in machine learning fields and NLP, using common frameworks such as PyTorch and TensorFlow.
- Research ability, as demonstrated by a track record of conference papers, open-source software, or other scientific activities.
- Ability to speak and write in English fluently and idiomatically.
- Preferred qualifications
-
Qualification at the Ph.D level or higher in natural language processing, artificial intelligence, or machine learning is desirable.
- Product, Service
-
Content creation support for movies/music/games
- Development Environment
-
Python, Linux
- Entry period
-
Open until positions are filled
- How to enter
-
Please register on the 2028 Graduate Internship & New Graduate Recruitment My Page.
After completing your registration, please access【ソニーグループ株式会社 新卒採用】banner.
Then fill out and submit the Profile Sheet for【長期有給インターン】.
- Treatment
-
PhD students:2,200 JPY / hour
Bachelor's / Master's students: 2,000 JPY / hour
Compensation will be determined based on skills and experience.
- Location
- Sony City Osaki
- Company
- Sony Group Corporation