'[논문 리뷰]/자연어처리' 카테고리의 글 목록

[논문 리뷰]/자연어처리 5

[논문 리뷰] Transformer: Attention Is All You Need

논문: https://arxiv.org/abs/1706.03762저자: Ashish Vaswani* (Google Brain), Noam Shazeer∗ (Google Brain), Niki Parmar∗(Google Research), Jakob Uszkoreit∗ (Google Research), Llion Jones∗ (Google Research), Aidan N. Gomez∗$^†$ (University of Toronto), Łukasz Kaiser∗ (Google Brain), Illia Polosukhin∗$^‡$∗ Equal contribution, $^†$ Work performed while at Google Brain. $^‡$ Work performed while at Google..

[논문 리뷰]/자연어처리 2025.06.20

[논문 리뷰] Seq2Seq: Sequence to Sequence Learning with Neural Networks

논문: https://arxiv.org/abs/1409.3215저자: Ilya Sutskever - ilyasu@google.com, Oriol Vinyals - vinyals@google.com, Quoc V. Le - qvl@google.com인용: Sutskever, I. "Sequence to Sequence Learning with Neural Networks." arXiv preprint arXiv:1409.3215 (2014). 0. 초록 (Abstract) 심층신경망 (Deep Neural Networks, DNNS)은 대규모 라벨 데이터셋이 존재한다면, 여러 테스크에 대해 좋은 성능을 보여주었다. 하지만 문장 번역에 대해서는 그렇지 않다. 저자는 언어 구조에 대한 최소한의 조건으로 문..

[논문 리뷰]/자연어처리 2024.12.10

[논문 리뷰] LSTM: Long Short-Term Memory

논문: https://ieeexplore.ieee.org/abstract/document/6795963 저자: Sepp Hochreiter(Fakultät für Informatik, Technische Universität München, 80290 München, Germany), Jürgen Schmidhuber(IDSIA, Corso Elvezia 36, 6900 Lugano, Switzerland) 인용: S. Hochreiter and J. Schmidhuber, "Long Short-Term Memory," in Neural Computation, vol. 9, no. 8, pp. 1735-1780, 15 Nov. 1997, doi: 10.1162/neco.1997.9.8.1735. 참고:1..

[논문 리뷰]/자연어처리 2024.12.03

[논문 리뷰] Attention: Neural Machine Translation by Jointly Learning to Align and Translate

논문: https://arxiv.org/abs/1409.0473 저자: Dzmitry Bahdanau (Jacobs University Bremen, Germany) KyungHyun Cho and Yoshua Bengio* (Universit´e de Montr´eal)* CIFAR Senior Fellow 인용: Bahdanau, Dzmitry. "Neural machine translation by jointly learning to align and translate." arXiv preprint arXiv:1409.0473 (2014). 튜토리얼 코드: https://tutorials.pytorch.kr/intermediate/seq2seq_translation_tutorial.html 데이터..

[논문 리뷰]/자연어처리 2024.11.29

[논문 리뷰] LORA: Low-Rank Adaptation of Large Language Models

논문: https://arxiv.org/abs/2106.09685저자: Edward Hu* Yelong Shen* Phillip Wallis Zeyuan Allen-Zhu Yuanzhi Li SheanWang Lu Wang Weizhu Chen (Microsoft Corporation)*Equal contribution인용: Hu, Edward J., et al. "Lora: Low-rank adaptation of large language models." arXiv preprint arXiv:2106.09685 (2021). 깃허브: https://github.com/microsoft/LoRA0. 초록 (Abstract) NLP에 대한 주요 인식은 일반적인 데이터로 대규모 사전학습한 모델을 특정 ta..

[논문 리뷰]/자연어처리 2024.11.26

johyeongseob 님의 블로그

공부, 기록, 일상. 문의 : johs@dgu.ac.kr johyeongseob.github.io

supervised contrastive learning, deep supervision, pytorch, channel attention, 자료구조, supcon, feature fusion, multi-light source, defect detection,

Today :
Yesterday :

« 2025/07 »
일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

[논문 리뷰]/자연어처리 5

티스토리툴바