Solar 10.7 b: Scaling large language models with simple yet effective depth up-scaling D Kim, C Park, S Kim, W Lee, W Song, Y Kim, H Kim, Y Kim, H Lee, J Kim, ... arXiv preprint arXiv:2312.15166, 2023 | 118* | 2023 |
sDPO: Don't Use Your Data All at Once D Kim, Y Kim, W Song, H Kim, Y Kim, S Kim, C Park arXiv preprint arXiv:2403.19270, 2024 | 18 | 2024 |
AiRS: a large-scale recommender system at naver news H Lim, YC Lee, JS Lee, S Han, S Kim, Y Jeong, C Kim, J Kim, S Han, ... 2022 IEEE 38th International Conference on Data Engineering (ICDE), 3386-3398, 2022 | 14 | 2022 |
Open Ko-LLM Leaderboard: Evaluating Large Language Models in Korean with Ko-H5 Benchmark C Park, H Kim, D Kim, S Cho, S Kim, S Lee, Y Kim, H Lee arXiv preprint arXiv:2405.20574, 2024 | 11* | 2024 |
Is it enough just looking at the title? Leveraging body text to enrich title words towards accurate news recommendation T Kim, Y Kim, YC Lee, WY Shin, SW Kim Proceedings of the 31st ACM International Conference on Information …, 2022 | 9 | 2022 |
MONET: Modality-Embracing Graph Convolutional Network and Target-Aware Attention for Multimedia Recommendation Y Kim, T Kim, WY Shin, SW Kim Proceedings of the 17th ACM International Conference on Web Search and Data …, 2024 | 7 | 2024 |
초거대 언어모델 연구 동향 박찬준, 이원성, 김윤기, 김지후, 이활석 정보과학회지 41 (11), 8-24, 2023 | 7 | 2023 |
Evalverse: Unified and Accessible Library for Large Language Model Evaluation J Kim, W Song, D Kim, Y Kim, Y Kim, C Park arXiv preprint arXiv:2404.00943, 2024 | 5 | 2024 |
Dataverse: Open-Source ETL (Extract, Transform, Load) Pipeline for Large Language Models H Park, S Lee, G Gim, Y Kim, D Kim, C Park arXiv preprint arXiv:2403.19340, 2024 | 2 | 2024 |
LP Data Pipeline: Lightweight, Purpose-driven Data Pipeline for Large Language Models Y Kim, H Ha, S Yang, S Lee, J Kim, C Park arXiv preprint arXiv:2411.11289, 2024 | | 2024 |
Open Ko-LLM Leaderboard2: Bridging Foundational and Practical Evaluation for Korean LLMs H Kim, D Kim, J Kim, S Lee, Y Kim, C Park arXiv preprint arXiv:2410.12445, 2024 | | 2024 |
Representing the Under-Represented: Cultural and Core Capability Benchmarks for Developing Thai Large Language Models D Kim, S Lee, Y Kim, A Rutherford, C Park arXiv preprint arXiv:2410.04795, 2024 | | 2024 |
InstaTrans: An Instruction-Aware Translation Framework for Non-English Instruction Datasets Y Kim, C Park arXiv preprint arXiv:2410.01512, 2024 | | 2024 |
1 Trillion Token (1TT) Platform: A Novel Framework for Efficient Data Sharing and Compensation in Large Language Models C Park, H Ha, J Kim, Y Kim, D Kim, S Lee, S Yang arXiv preprint arXiv:2409.20149, 2024 | | 2024 |
Rethinking KenLM: Good and Bad Model Ensembles for Efficient Text Quality Filtering in Large Web Corpora Y Kim, H Ha, S Lee, J Kim, S Yang, C Park arXiv preprint arXiv:2409.09613, 2024 | | 2024 |
SAAS: Solving Ability Amplification Strategy for Enhanced Mathematical Reasoning in Large Language Models H Kim, G Gim, Y Kim, J Kim, B Kim, W Lee, C Park arXiv preprint arXiv:2404.03887, 2024 | | 2024 |
그래프 합성곱 신경망 기반 멀티미디어 추천에서의 모달리티 융합 방법 평가 김윤기, 김태리, 김상욱 한국소프트웨어종합학술대회, 113-115, 2022 | | 2022 |