Speech Research Scientist (Prosody Modeling)

Company: Oben Bouwmachines
Location: Pasadena , California, United States
Type: Full-time
Posted: 09.OCT.2019


ObEN's mission is to enable everyone in the world to create their own Personal AI (PAI), intelligent 3D avatars that look, sound, and behave...


ObEN's mission is to enable everyone in the world to create their own Personal AI (PAI), intelligent 3D avatars that look, sound, and behave like the individual user. Secured and authenticated on the Project PAI blockchain, ObEN's technology creates more productive, more personalized digital interactions. ObEN is a K11, Tencent, Softbank Ventures Korea and HTC Vive X portfolio company, and we work with our strategic investors to expand PAI technology across multiple verticals including hospitality, retail, healthcare, and entertainment.

Working at ObEN means taking on extraordinary transformations every day, in an environment that celebrates and encourages innovation. You'll be working in small, agile teams (including world class researchers in areas of speech, computer vision, machine learning, NLP, and blockchain). We are blazing new trails in AI and blockchain technology, and we encourage and support publications to top conferences and journals. Learn more about working at ObEN in our blog post.

Job Description: new prosody models for different languages (Chinese, English, Japanese, Korean) to improve the naturalness and the similarity of the synthesized voice and to allow a better control of its expressivity. Responsibilities: Develop new prosody model for different languages, adaptable using a small amount of data; Develop generic prosodic models for different expressivity which can be applied to any voice; Develop sentiment analysis algorithms to control expressivity from text input. Requirements: PhD with strong experience in Prosody Modeling for Speech Synthesis demonstrated by publications in top Speech Journals and Conferences (Speech prosody, Icassp, Interspeech, etc); Strong implementation skills and general knowledge in ML; Fluent in Python and C++, and good knowledge of deep learning packages; Familiarity with linguistic phonetics; Knowledge of basic digital signal processing techniques for audio. Application requirements:
  • Detailed resume and/or LinkedIn profile
  • Links to any research / papers you have been an instrumental part of and are proud of
  • Name of instructor / adviser, if any along with link to their profile
  • Cover Letter identifying your five favorite apps on your phone
  • Interview process:

    STAGE 1: Phone InterviewSTAGE 2: In-person Interview at Idealab (we cover travel expenses for the day)STAGE 3: We require a sample project submission and a candidate proposal submission(To know more about what an ObEN candidate proposal is, click here)STAGE 4: Spend a day at our office and participate in all team activities.STAGE 5: Offer Letter


    Not ready to apply for this job? Sign-up to receive ObEN job alerts.

    Apply Now


    Free eBook

    Loader2 Processing ...