makeasnek@lemmy.ml to

AI@lemmy.mlEnglish · 5 months ago

LLM ASICs on USB sticks?

1

LLM ASICs on USB sticks?

makeasnek@lemmy.ml to

AI@lemmy.mlEnglish · 5 months ago

Source: nostr

https://snort.social/nevent1qqsg9c49el0uvn262eq8j3ukqx5jvxzrgcvajcxp23dgru3acfsjqdgzyprqcf0xst760qet2tglytfay2e3wmvh9asdehpjztkceyh0s5r9cqcyqqqqqqgt7uh3n

Paper: https://arxiv.org/abs/2406.02528

Chat

Fisch@discuss.tchncs.de
link
fedilink
English
arrow-up
0·
5 months ago
That’s weird, maybe I actually am doing something wrong. Is it because I’m using GGUF models maybe?
- Mike1576218@lemmy.ml
  link
  fedilink
  arrow-up
  0·
  5 months ago
  llama2 gguf with 2bit quantisation only needs ~5gb vram. 8bits need >9gb. Anything inbetween is possible. There are even 1.5bit and even 1bit options (not gguf AFAIK). Generally fewer bits means worse results though.

AI@lemmy.ml

artificial_intel@lemmy.ml

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

Artificial intelligence (AI) is intelligence demonstrated by machines, unlike the natural intelligence displayed by humans and animals, which involves consciousness and emotionality. The distinction between the former and the latter categories is often revealed by the acronym chosen.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
2 users / week
13 users / month
196 users / 6 months
0 local subscribers
4.14K subscribers
111 Posts
265 Comments
Modlog

mods: