Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
swpresley 's Collections
HensonResearchFor1BitLLMs

HensonResearchFor1BitLLMs

updated Mar 16, 2024
Upvote
-

  • The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

    Paper • 2402.17764 • Published Feb 27, 2024 • 630

  • Atom: Low-bit Quantization for Efficient and Accurate LLM Serving

    Paper • 2310.19102 • Published Oct 29, 2023 • 11

  • AMSP: Super-Scaling LLM Training via Advanced Model States Partitioning

    Paper • 2311.00257 • Published Nov 1, 2023 • 10

  • BiLLM: Pushing the Limit of Post-Training Quantization for LLMs

    Paper • 2402.04291 • Published Feb 6, 2024 • 50

  • OneBit: Towards Extremely Low-bit Large Language Models

    Paper • 2402.11295 • Published Feb 17, 2024 • 24

  • Improving Text Embeddings with Large Language Models

    Paper • 2401.00368 • Published Dec 31, 2023 • 83

  • LoRAShear: Efficient Large Language Model Structured Pruning and Knowledge Recovery

    Paper • 2310.18356 • Published Oct 24, 2023 • 24
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs