The dataset viewer is not available for this split.
Server error while post-processing the split rows. Please report the issue.
Error code:   RowsPostProcessingError

Need help to make the dataset viewer work? Open a discussion for direct support.

Dataset Card for Genshin Voice

Dataset Description

Dataset Summary

The Genshin Voice dataset is a text-to-voice dataset of different Genshin Impact characters unpacked from the game.

Languages

The text in the dataset is in Mandarin.

Dataset Creation

Source Data

Initial Data Collection and Normalization

The data was obtained by unpacking the Genshin Impact game.

Who are the source language producers?

The language producers are the employee of Hoyoverse and contractors from EchoSky Studio.

Annotations

The dataset contains official annotations from the game, including ingame speaker name and transcripts.

Additional Information

Dataset Curators

The dataset was created by w4123 initially in his GitHub repository.

Licensing Information

Copyright © COGNOSPHERE. All Rights Reserved.

Downloads last month
11
Edit dataset card
Evaluate models HF Leaderboard