mehmetcanbudak 's Collections arXiv
updated
LoRA: Low-Rank Adaptation of Large Language Models
Paper
• 2106.09685
• Published • 60
Attention Is All You Need
Paper
• 1706.03762
• Published • 120
Direct Preference Optimization: Your Language Model is Secretly a Reward
Model
Paper
• 2305.18290
• Published • 64
Lost in the Middle: How Language Models Use Long Contexts
Paper
• 2307.03172
• Published • 44
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
Paper
• 2005.11401
• Published • 14
FlashAttention: Fast and Memory-Efficient Exact Attention with
IO-Awareness
Paper
• 2205.14135
• Published • 15
Not All Attention Is All You Need
Paper
• 2104.04692
• Published • 1
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper
• 2307.09288
• Published • 251
Paper
• 2310.06825
• Published • 58
QLoRA: Efficient Finetuning of Quantized LLMs
Paper
• 2305.14314
• Published • 61