You need to agree to share your contact information to access this dataset

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this dataset content.

Korean Visual Instruct 150K Dataset Card

๐ŸŒ‹LLaVA์˜ Instruction-following Dataset์„ ํ•œ๊ตญ์–ด๋กœ ๋ฒˆ์—ญํ•œ ๋ฐ์ดํ„ฐ์…‹์ž…๋‹ˆ๋‹ค. (feat. DeepL)

1. Conversation

  • ์ด๋ฏธ์ง€์— ๋Œ€ํ•ด ์งˆ๋ฌธํ•˜๋Š” ์‚ฌ๋žŒ๊ณผ ์ด์— ๋‹ตํ•˜๋Š” Assistant ์‚ฌ์ด์˜ ๋Œ€ํ™” ํ˜•์‹์œผ๋กœ ๋””์ž์ธํ•ฉ๋‹ˆ๋‹ค. ๋Œ€๋‹ต์€ Assistant๊ฐ€ ์ด๋ฏธ์ง€๋ฅผ ๋ณด๊ณ  ์งˆ๋ฌธ์— ๋Œ€๋‹ตํ•˜๋Š” ๊ฒƒ๊ณผ ๊ฐ™์€ ์–ด์กฐ์ด๋ฉฐ, ์ด๋ฏธ์ง€์˜ ์‹œ๊ฐ์ ์ธ ์ •๋ณด(๊ฐ์ฒด์˜ ์œ ํ˜•, ์ˆ˜, ํ–‰๋™, ์œ„์น˜, ๊ฐ์ฒด๊ฐ„์˜ ์ƒ๋Œ€์ ์ธ ์œ„์น˜ ๋“ฑ)์— ๋Œ€ํ•ด ๋‹ค์–‘ํ•œ ์งˆ๋ฌธ์„ ํ•ฉ๋‹ˆ๋‹ค. ๋˜ํ•œ ๋ช…ํ™•ํ•˜๊ฒŒ ๋‹ต๋ณ€์ด ์žˆ๋Š” ์งˆ๋ฌธ๋งŒ ๊ณ ๋ คํ•ฉ๋‹ˆ๋‹ค.

2. Detailed description

  • ์ด๋ฏธ์ง€์— ๋Œ€ํ•œ ํ’๋ถ€ํ•˜๊ณ  ํฌ๊ด„์ ์ธ ์„ค๋ช…์„ ๋‚ดํฌํ•˜๊ฒŒ ๋””์ž์ธ ํ–ˆ์Šต๋‹ˆ๋‹ค. ์ด๋Ÿฌํ•œ ์ž์„ธํ•œ ์„ค๋ช…์„ ์š”๊ตฌํ•˜๋Š” ์—ฌ๋Ÿฌ promt ๋ฆฌ์ŠคํŠธ๋ฅผ ๋งŒ๋“  ๋’ค ๊ทธ์ค‘ ํ•˜๋‚˜๋ฅผ ์ƒ˜ํ”Œํ•ด ๋‹ต์„ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค.

3. Complex reasoning

  • ์œ„์˜ ๋‘ ๊ฐ€์ง€ ์œ ํ˜•์€ ์‹œ๊ฐ์  content ์ž์ฒด์— ์ค‘์ ์„ ๋‘๋Š”๋ฐ์š”. Complex reasoning์—์„œ๋Š” ์ด๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ์‹ฌ์ธต ์ถ”๋ก  ์งˆ๋ฌธ์„ ์ถ”๊ฐ€๋กœ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค. ๋‹ต๋ณ€์€ ํƒ€๋‹นํ•œ ๋…ผ๋ฆฌ๋ฅผ ๊ฐ–์ถ˜ ๋‹จ๊ณ„๋ณ„ ์ถ”๋ก  ํ”„๋กœ์„ธ์Šค๋ฅผ ์š”๊ตฌํ•ฉ๋‹ˆ๋‹ค.

Done

  • Detail_23k
  • Conversation_58k
  • Complex_resoning_77k
  • ko_llava_instruct_150k

Project Repo

License

  • Attribution-NonCommercial 4.0 International | OpenAI policy ์ค€์ˆ˜
Downloads last month
1

Models trained or fine-tuned on tabtoyou/KoLLaVA-Instruct-150k