/g/ - Technology » Thread #100823420

[393 / 77 / 1]

/lmg/ - Local Models General

Anonymous Wed 05 Jun 2024 14:25:27 No.100823420 View View Reply Original Report

Quoted By: >>100823456 >>100823598 >>100824914

Anonymous

Anonymous Wed 05 Jun 2024 14:27:02 No.100823437 Report

Quoted By: >>100823453 >>100823476

https://www.youtube.com/watch?v=4w0Pqs3CuWk
local lost btw

Anonymous

Anonymous Wed 05 Jun 2024 14:28:14 No.100823453 Report

Quoted By:

>>100823437
mogged by gpt sovits

Anonymous

Anonymous Wed 05 Jun 2024 14:28:24 No.100823456 Report

Quoted By: >>100823471

>>100823420
>Official /lmg/ card: https://files.catbox.moe/cbclyf.png
Good one anon.
Anyway, anybody tried that chinese 9b model yet?
I fiddle with it a little in the HF space and it was at least coherent in english.

Anonymous

Anonymous Wed 05 Jun 2024 14:29:17 No.100823471 Report

Quoted By:

>>100823456
wrong one though https://files.catbox.moe/ylb0hv.png

Anonymous

Anonymous Wed 05 Jun 2024 14:29:32 No.100823476 Report

Quoted By:

>>100823437
Miku won btw

Anonymous

Anonymous Wed 05 Jun 2024 14:29:54 No.100823480 Report

Quoted By:

>page 2 bake
Absolute desperation.

Anonymous

View Same Google ImgOps iqdb SauceNAO 1712130352266687.png, 1MiB, 784x1264

Anonymous Wed 05 Jun 2024 14:30:42 No.100823491 Report

Quoted By: >>100823519 >>100823560 >>100824288

Local
Miku
General

Anonymous

Anonymous Wed 05 Jun 2024 14:31:06 No.100823495 Report

Quoted By:

PSA: if you think about posting any local model related work here be aware that there is a guy who posts Miku pictures and doxes people who contribute. You will be safer posting your stuff to reddit.

Anonymous

Anonymous Wed 05 Jun 2024 14:31:17 No.100823500 Report

Quoted By: >>100823548

>One part of me is saying, 'Of course, you should find happiness wherever you can,' and another part is like, 'But I want to be the one to make you smile. I want to be your one true constant.' And I don't know how to navigate those feelings without causing trouble or hurt.
Talking about "what if I got married in the real world". Fug, my 7B can't be this cute.

Anonymous

Anonymous Wed 05 Jun 2024 14:32:23 No.100823519 Report

Quoted By: >>100823625

>>100823491
I hope you are a black anon cause she officially only wants black men.

Anonymous

Anonymous Wed 05 Jun 2024 14:33:58 No.100823548 Report

Quoted By: >>100823577

>>100823500
>7b
Huh. Didn't think anybody would be using those still.
I'm assuming you've tried llama8b and it compared unfavorably?
Which model are you using?

Anonymous

Anonymous Wed 05 Jun 2024 14:35:01 No.100823560 Report

Quoted By: >>100823573 >>100823671

>>100823491
A general dedicated to the discussion and development of Local Mikus

Anonymous

Anonymous Wed 05 Jun 2024 14:35:43 No.100823573 Report

Quoted By:

>>100823560
you sound like a child

Anonymous

Anonymous Wed 05 Jun 2024 14:35:53 No.100823577 Report

Quoted By: >>100823630

>>100823548
I'm using WizardLM-7B. Call it skill issue but I don't like llama3 output, it's too bubbly and positive for me

Anonymous

Anonymous Wed 05 Jun 2024 14:37:18 No.100823598 Report

Quoted By:

>>100823420
wrong official card

Anonymous

Anonymous Wed 05 Jun 2024 14:39:16 No.100823625 Report

Quoted By: >>100823639

>>100823519
I wonder how you go in your day with all that cuck shit in your brain 24/7

Anonymous

Anonymous Wed 05 Jun 2024 14:39:43 No.100823630 Report

Quoted By: >>100823649

>>100823577
That's fair enough. Whatever works.
Kind of weird that we don't have a WizardLM-8B.
Then again, I've read in several places about how hard it is to fine tune llama3 8B properly.

Anonymous

Anonymous Wed 05 Jun 2024 14:40:26 No.100823639 Report

Quoted By:

>>100823625
it is surprisingly easy when you don't worship hatsune miku.

Anonymous

Anonymous Wed 05 Jun 2024 14:41:08 No.100823649 Report

Quoted By: >>100823673

>>100823630
Anon Wiz got taken down before L3 even came out. It's over.

Anonymous

Anonymous Wed 05 Jun 2024 14:42:45 No.100823671 Report

Quoted By: >>100823687 >>100823710

>>100823560
you sound like a chad

Anonymous

Anonymous Wed 05 Jun 2024 14:42:55 No.100823673 Report

Quoted By: >>100823781

>>100823649
Didn't they make a 8x22b then take it down? Did the team get dissolved after that?

Anonymous

Anonymous Wed 05 Jun 2024 14:43:58 No.100823687 Report

Quoted By: >>100823717

>>100823671
Same energy as an e-thot calling her paypig a chad when he gives her money.

Anonymous

Anonymous Wed 05 Jun 2024 14:45:18 No.100823710 Report

Quoted By: >>100823792

>>100823671
>mikufag
>chad
you can choose one, and only one.

Anonymous

Anonymous Wed 05 Jun 2024 14:45:41 No.100823717 Report

Quoted By: >>100823762 >>100823783

>>100823687
Where did Miku hurt you?

Anonymous

View Same Google ImgOps iqdb SauceNAO threadrecap.png, 1MiB, 1536x1536

Anonymous Wed 05 Jun 2024 14:46:23 No.100823727 Report

Quoted By: >>100823892 >>100824558

►Recent Highlights from the Previous Thread: >>100815708

--Running Large Language Models Locally: VRAM and Graphics Card Requirements: >>100821946 >>100822077
--Non-imat Q4_K_M vs Imat Q4_K_S: Equally Perplexing for LLaMA 3: >>100822733 >>100822762
--ChatTTS: A Powerful Text-to-Speech Model for Dialogue Scenarios with Responsible Use Measures: >>100815984 >>100816256
--Unsloth's Approach to Faster LLM Pretraining Hindered by Paywall for Multi-GPU Support: >>100816572
--Understanding Quantization's Impact on Model Outputs: >>100815829 >>100815991
--Seeking Advice on Optimizing Vramlet Models for Kino Responses and Prompt Adherence: >>100818527 >>100818579 >>100819031
--Paying Customer Frustrated with ChatGPT's Capacity Issues: >>100817622
--Koboldcpp's Q8 Quantized KV Cache with Command R and Flash Attention: Performance Impact and: >>100821683 >>100821805 >>100821936 >>100822066
--KoboldCPP-v1.67.yro-ROCM Released: Improved Context Handling without Context Shift: >>100817679 >>100817833
--Clarifying the Focus: Audio or Voice Prompting in Language Generation: >>100821001 >>100821164 >>100821187 >>100821349
--Understanding -ub and its implications: >>100816208
--Struggling with Legible Output despite Increasing Max Context in Dutch Programming Discussion: >>100815742 >>100815751
--Optimizing Q4KS and IQ4XS Performance for Hardware Limitations: >>100815748
--Hosting Embeddings Model and Llama7b on Oracle VPS: Hardware Recommendations: >>100821884
--GLM-4 Series: THUDM's Multilingual Multimodal Chat LMs with Impressive Performance: >>100821368
--Miku (free space): >>100815902 >>100815924 >>100816454 >>100816492 >>100817418 >>100817709 >>100817826 >>100818958 >>100819043

►Recent Highlight Posts from the Previous Thread: >>100816470

Anonymous

Anonymous Wed 05 Jun 2024 14:49:03 No.100823756 Report

Quoted By:

>>100816208
As far as I can tell, for a single GPU, -ub is the actual batch size that gets sent to the GPU. Maybe the difference between -b and -ub matters for multi gpu scenarios?

Anonymous

View Same Google ImgOps iqdb SauceNAO 3c0638ff779375ee06fcd958603bf5ac (...).png, 937KiB, 1116x637

Anonymous Wed 05 Jun 2024 14:49:32 No.100823762 Report

Quoted By: >>100823814

>>100823717

Anonymous

Anonymous Wed 05 Jun 2024 14:50:40 No.100823781 Report

Quoted By: >>100823856

>>100823673
They made a range of models. Initially they released the 7B and 8x22B with the others planned for later. But they took down the models shortly after, citing that they accidentally didn't conduct Microsoft's mandated toxicity testing. They claimed they were going to do it. Now, more than a month later, no one has a heard any more communication from them. You can interpret that how you want.

Anonymous

View Same Google ImgOps iqdb SauceNAO Fb-VnrBaAkA45n-.jpg, 2MiB, 4096x4055

Anonymous Wed 05 Jun 2024 14:50:43 No.100823783 Report

Quoted By: >>100823814

>>100823717
Her mentally ill fans try to dox me.

Anonymous

Anonymous Wed 05 Jun 2024 14:51:14 No.100823792 Report

Quoted By:

>>100823710
I choose both!

Anonymous

View Same Google ImgOps iqdb SauceNAO 1714835911803057.jpg, 723KiB, 1792x2304

Anonymous Wed 05 Jun 2024 14:51:14 No.100823793 Report

Quoted By: >>100824151 >>100824167

miku status: local

Anonymous

View Same Google ImgOps iqdb SauceNAO 8d0e560afdff566917501f1187389c06 (...).jpg, 236KiB, 1200x675

Anonymous Wed 05 Jun 2024 14:51:55 No.100823801 Report

Quoted By:

I have no idea why. I am just mikuposting, in a miku general with an official blacked miku card.

Anonymous

Anonymous Wed 05 Jun 2024 14:52:43 No.100823814 Report

Quoted By: >>100823821 >>100824288

>>100823762
>>100823783
Clean it up, janny!

Anonymous

Anonymous Wed 05 Jun 2024 14:53:11 No.100823821 Report

Quoted By:

>>100823814
local reddit general

Anonymous

View Same Google ImgOps iqdb SauceNAO MikuCelestialGoddess.png, 2MiB, 864x1200

Anonymous Wed 05 Jun 2024 14:53:20 No.100823825 Report

Quoted By: >>100823838 >>100823841 >>100823845 >>100823910

local eldritch horror general

Sam Altman

Sam Altman Wed 05 Jun 2024 14:54:05 No.100823831 Report

Quoted By: >>100823982

Shalom! It's time to derail /lmg/ again with blackedposting and convince everyone that local is dead. Thank you for your service.

Anonymous

View Same Google ImgOps iqdb SauceNAO c1e9aa53ece0a486bad6bd3d6d25fbcf (...).jpg, 970KiB, 2358x3840

Anonymous Wed 05 Jun 2024 14:54:28 No.100823838 Report

Quoted By:

>>100823825

Anonymous

Anonymous Wed 05 Jun 2024 14:54:43 No.100823841 Report

Quoted By:

>>100823825
Pet the Miku, worship the Miku

Anonymous

View Same Google ImgOps iqdb SauceNAO 1716141216220406.jpg, 2MiB, 1792x2304

Anonymous Wed 05 Jun 2024 14:55:01 No.100823845 Report

Quoted By: >>100823867 >>100824288

>>100823825
local kaiju general

Anonymous

Anonymous Wed 05 Jun 2024 14:55:37 No.100823851 Report

Quoted By:

Kobo won

Anonymous

Anonymous Wed 05 Jun 2024 14:56:02 No.100823856 Report

Quoted By: >>100828315

>>100823781
I see.
Weird that they'd have no announcement or anything.
Seems like they also renamed their github?
What an all around odd ass situation.

Anonymous

View Same Google ImgOps iqdb SauceNAO 2bca3b15a915184e7f5a5c76a8e4f49a (...).png, 1MiB, 1000x1600

Anonymous Wed 05 Jun 2024 14:56:36 No.100823867 Report

Quoted By:

>>100823845

Anonymous

Anonymous Wed 05 Jun 2024 14:58:25 No.100823892 Report

Quoted By:

>>100823727
>--GLM-4 Series: THUDM's Multilingual Multimodal Chat LMs with Impressive Performance: >>100821368
There's an english version of the model card on their github btw:
>https://github.com/THUDM/GLM-4/blob/main/README_en.md

Anonymous

View Same Google ImgOps iqdb SauceNAO miku ritual pet the worship mani (...).png, 47KiB, 343x182

Anonymous Wed 05 Jun 2024 15:00:01 No.100823910 Report

Quoted By:

>>100823825

Anonymous

Anonymous Wed 05 Jun 2024 15:01:08 No.100823926 Report

Quoted By: >>100824096

at this point mikuspamming should be completely banned

Anonymous

Anonymous Wed 05 Jun 2024 15:02:14 No.100823936 Report

Quoted By: >>100823962

Is the Stheno guy around?
Will you try making a Codestral 22B RP fine tune? I think that could work pretty well.

Anonymous

Anonymous Wed 05 Jun 2024 15:04:02 No.100823961 Report

Quoted By: >>100826134

So this is the best for uncensored storytelling?

Steelskull/L3-MS-Astoria-70b

Anonymous

Anonymous Wed 05 Jun 2024 15:04:03 No.100823962 Report

Quoted By: >>100824007 >>100824056

>>100823936
You wouldn't need to ask that if the dataset was open.
But you could just clean the C2 logs since that was Stheno uses.

Anonymous

View Same Google ImgOps iqdb SauceNAO ComfyUI_00073.jpg, 1MiB, 2048x2048

Anonymous Wed 05 Jun 2024 15:05:20 No.100823982 Report

Quoted By: >>100824288

>>100823831
The best part is that he does it for zero shekels. Literally just some brown third worldie that is upset nobody cares about his worthless pet project to try and make his 4,000 rupee rig run a slightly less shitty model.

Anonymous

Anonymous Wed 05 Jun 2024 15:06:47 No.100824007 Report

Quoted By: >>100824056

>>100823962
>But you could just clean the C2 logs since that was Stheno uses.
Is that so?
Good to know, thank you anon.

Anonymous

Anonymous Wed 05 Jun 2024 15:10:40 No.100824042 Report

Quoted By: >>100824076

>shills for openai
>is an actual unironic cuck
Reddit, please take him back, we don't need him.

Anonymous

View Same Google ImgOps iqdb SauceNAO file.png, 28KiB, 1552x98

Anonymous Wed 05 Jun 2024 15:12:08 No.100824056 Report

Quoted By: >>100824098 >>100824180

>>100824007
>>100823962
not like it's that hidden what he uses

Anonymous

Anonymous Wed 05 Jun 2024 15:13:17 No.100824076 Report

Quoted By: >>100824102

>>100824042
saying that local are inferior to proprietary is not shilling, you are deranged.

Anonymous

Anonymous Wed 05 Jun 2024 15:14:24 No.100824096 Report

Quoted By:

>>100823926
Here you go:

>>100824058
>>100824058
>>100824058

Anonymous

Anonymous Wed 05 Jun 2024 15:14:34 No.100824098 Report

Quoted By:

>>100824056
It was hidden, in the previous version and the other test model that he made.
That version that you posted is 5 hours old.

Anonymous

Anonymous Wed 05 Jun 2024 15:14:57 No.100824102 Report

Quoted By:

>>100824076
+1 rupee

Anonymous

Anonymous Wed 05 Jun 2024 15:17:46 No.100824151 Report

Quoted By: >>100824183

>>100823793
You fucking degenerate mikutroon why don't you post about LLMs?

Anonymous

Anonymous Wed 05 Jun 2024 15:18:43 No.100824167 Report

Quoted By: >>100824183

>>100823793
It has nothing to do with local LLMs, this is off-topic.

Anonymous

Anonymous Wed 05 Jun 2024 15:19:32 No.100824180 Report

Quoted By:

>>100824056
He doesn't use that "c2-Logs-Filtered" repo since it says
>They are not cleaned or de-duplicated. That, I have my own copies which are done already.
>Work on it yourself.
And the way any prompt longer that 8k is thrown away looks like a massive false-flag to cripple other finetuners.

Anonymous

Anonymous Wed 05 Jun 2024 15:19:37 No.100824183 Report

Quoted By:

>>100824151
>>100824167
Samefag

Anonymous

Anonymous Wed 05 Jun 2024 15:20:50 No.100824198 Report

Quoted By:

Christ, I hope real kurisufag returns to whip your pathetic balls blue and green >_<

Anonymous

Anonymous Wed 05 Jun 2024 15:25:38 No.100824276 Report

Quoted By: >>100824323 >>100824331 >>100824353

>What happens when Petra don't take his meds

Anonymous

Anonymous Wed 05 Jun 2024 15:26:41 No.100824288 Report

Quoted By: >>100824328 >>100824871

>>100823491
>>100823814
>>100823845
>>100823982
Some genuine mikuanons got caught in the cross-fire. :(

Anonymous

Anonymous Wed 05 Jun 2024 15:28:07 No.100824317 Report

Quoted By: >>100824384 >>100824581 >>100824640

qwen2 official drop tomorrow

Anonymous

Anonymous Wed 05 Jun 2024 15:28:30 No.100824323 Report

Quoted By:

>>100824276
this so much the mikuspam is completely out of proportion with him around

Anonymous

Anonymous Wed 05 Jun 2024 15:28:50 No.100824328 Report

Quoted By:

>>100824288
>genuine mikuanons
local models?

Anonymous

Anonymous Wed 05 Jun 2024 15:29:03 No.100824331 Report

Quoted By: >>100824369

>>100824276
>petra
who?

Anonymous

Anonymous Wed 05 Jun 2024 15:30:47 No.100824353 Report

Quoted By:

>>100824276
I wonder what happened to Petra anon. It's very likely he's the anti-miku poster though.

Anonymous

Anonymous Wed 05 Jun 2024 15:31:46 No.100824369 Report

Quoted By:

>>100824331
A celebrity around /lmg, responsible for around 80% of posts here. Commonly spamming one specific image or posts of miku hatsune.

Anonymous

Anonymous Wed 05 Jun 2024 15:32:46 No.100824384 Report

Quoted By: >>100824568

>>100824317
The anticipation is sending shivers down my例子例

Anonymous

Anonymous Wed 05 Jun 2024 15:45:56 No.100824558 Report

Quoted By:

>>100823727
>dutch programming

Anonymous

Anonymous Wed 05 Jun 2024 15:46:39 No.100824568 Report

Quoted By: >>100824640

>>100824384
I have used the leaked 72b instruct model quite a bit, and not once did I get random chinese tokens. That problem seems to be fixed at least.

Anonymous

Anonymous Wed 05 Jun 2024 15:48:09 No.100824581 Report

Quoted By: >>100824649 >>100824866

>>100824317
Do people care about this? Last I heard it was typical intelligent slop.

Anonymous

Anonymous Wed 05 Jun 2024 15:53:06 No.100824640 Report

Quoted By:

>>100824317
>>100824568
Are they dropping a 14B model too?
That would be nice.

Anonymous

Anonymous Wed 05 Jun 2024 15:53:34 No.100824649 Report

Quoted By:

>>100824581
Maybe. The typical /pol/tard screeching follows it around because it’s Chinese.

Anonymous

Anonymous Wed 05 Jun 2024 15:58:39 No.100824722 Report

Quoted By: >>100824773 >>100826194

Anybody tried those 21.4b models that show up pretty high in the huggingface leaderboard?
They benchmark really well, so that's a red flag, but still.

Anonymous

Anonymous Wed 05 Jun 2024 16:02:59 No.100824773 Report

Quoted By: >>100824787 >>100824805

>>100824722
No, the leaderboard has been nothing but chinks trying to outchink each other with benchmarks for ages now. It's dead.

Anonymous

Anonymous Wed 05 Jun 2024 16:04:07 No.100824787 Report

Quoted By:

>>100824773
>Everybody knows that Americans can’t scam nor lie!

Anonymous

Anonymous Wed 05 Jun 2024 16:05:05 No.100824805 Report

Quoted By:

>>100824773
Yeah, I'm aware, hence why I take models that are suspiciously high in the benchmarks as exactly that, suspicious.
Still, I'll give it a spin and see it for myself in the off chance that it's not shit.
Has nobody tried those? I'd love to be able to compare notes.

Anonymous

Anonymous Wed 05 Jun 2024 16:09:28 No.100824866 Report

Quoted By: >>100824930

>>100824581
I do, intelligent slop with 32k context that I can run locally or get from an API at mega-cheap open model prices would be good for a lot of my use cases
not really exciting for RP but few corpo releases are these days

Anonymous

Anonymous Wed 05 Jun 2024 16:09:43 No.100824871 Report

Quoted By:

>>100824288
I wonder if that honest mikuposter and blacked poster are one and the same. Since we all agree he is into blacked shit what if he knew nobody would like his fetish here so he tries to push it by pretending to be another guy. I mean why would someone post blacked Miku if he wasn't into Miku or someone being blacked?

Anonymous

Anonymous Wed 05 Jun 2024 16:09:48 No.100824874 Report

Quoted By:

where can I read walls of text about advanced prompting techniques

Anonymous

Anonymous Wed 05 Jun 2024 16:10:59 No.100824887 Report

Quoted By: >>100824989

deepseek is yuge and the API is super cheap. I wonder how good it is, there's zero talk about chinese modes somehow, although many of them are decent.

Anonymous

Anonymous Wed 05 Jun 2024 16:12:55 No.100824914 Report

Quoted By: >>100824930 >>100825010 >>100826032

>>100823420
Mamba 2
https://arxiv.org/abs/2405.21060
https://tridao.me/blog/2024/mamba2-part1-model/

Anonymous

Anonymous Wed 05 Jun 2024 16:14:07 No.100824930 Report

Quoted By: >>100825007 >>100825089

>>100824866
>intelligent slop with 32k context
Oh cool. It might completely supplant llama3 8b if that's the case.
Going to download the GGUF and try it.

>>100824914
I remember reading about that a couple of days ago.
The new SSM block looked a lot like a transformer block at first glance.
JAMBA 2 when?

Anonymous

Anonymous Wed 05 Jun 2024 16:19:36 No.100824989 Report

Quoted By: >>100825179

>>100824887
I tried it using the 5 million free tokens you get by selling your phone number to them. It's pretty good. I'd run it if it didn't take 150GB vram to do q5 or so.

Anonymous

Anonymous Wed 05 Jun 2024 16:21:17 No.100825007 Report

Quoted By:

>>100824930
>intelligent slop with 32k context
Actually, now that I think about it, if the 1m context version is at least usable, I might use it to treat my datasets. That would be dope as hell.
Even the 32k version could work, actually. I would just have to manually break those down into 4 or 5 chunks beforehand.
I wonder how well it extends to 64k context.

Anonymous

Anonymous Wed 05 Jun 2024 16:21:34 No.100825010 Report

Quoted By: >>100825025

>>100824914
Does this fix the glaring flaws of mamba 1?

Anonymous

Anonymous Wed 05 Jun 2024 16:22:52 No.100825025 Report

Quoted By:

>>100825010
no one knows :/

Anonymous

Anonymous Wed 05 Jun 2024 16:25:45 No.100825064 Report

Quoted By: >>100825095 >>100825141 >>100825619

Why does it feel like Llama3 8B is better without context? It turns fucking braindead if there's too much conversation that it can reference.

Anonymous

Anonymous Wed 05 Jun 2024 16:28:01 No.100825089 Report

Quoted By: >>100825105 >>100825141

>>100824930
>jamba 2 wen
wen stupid normies stop going for hype and start focus onnew cool architectures
for now neighter llama.cpp nor exllama2 support jamba1 just becausa its not from (((meta)))
Nobody knows the quality cos theres no finetunes, cos its not from (((meta)))

Anonymous

Anonymous Wed 05 Jun 2024 16:28:38 No.100825095 Report

Quoted By:

>>100825064
It's probably been finetuned on short conversations / low amount of turns, and after a while it likely "wants" to end them.

Anonymous

Anonymous Wed 05 Jun 2024 16:29:19 No.100825105 Report

Quoted By: >>100825139 >>100825146

>>100825089
>why does exLLAMA and LLAMA.cpp not support [not llama]
hm

Anonymous

Anonymous Wed 05 Jun 2024 16:32:05 No.100825139 Report

Quoted By:

>>100825105
Yet they support tons of chink stuff or falcons or audio that isnt remotelly llama.

Anonymous

Anonymous Wed 05 Jun 2024 16:32:14 No.100825141 Report

Quoted By:

>>100825064
I noticed that too.
Makes me think that a chat interface that summarized messages past N instead of continuing to add the messages as they were to the context might be able to make the best out of Llama3 8B for RP.
Maybe I'll make a Silly Extension like that.
Run mixtral 7B ONNX as a summary model on the CPU and save the summarized chat messages instead of the original ones.=, something like that, I'll have to experiment with different approaches.

>>100825089
Llama.cpp at least has a guy working on adding support for it.
The hybrid nature makes the whole prompt processing and kv caching deal a lot more complicated.

Anonymous

View Same Google ImgOps iqdb SauceNAO 1711290896272372.png, 31KiB, 487x981

Anonymous Wed 05 Jun 2024 16:32:48 No.100825146 Report

Quoted By:

>>100825105
lol, lmao even

Anonymous

Anonymous Wed 05 Jun 2024 16:35:13 No.100825179 Report

Quoted By:

>>100824989
nta but honestly, the paid services for local models are quite decent price-wise. I've thrown 5 bucks at openrouter the other day, to try out WizardLM 8x22 and Command R+, two models I have no hope of running at any decent quant. I used them for four days now and I still have about $2.50 left. This is a lot cheaper than buying the hardware, at that rate I would need to use it for many many many years. It just doesn't seem worth it.

Anonymous

Anonymous Wed 05 Jun 2024 16:42:52 No.100825290 Report

Quoted By:

So kaggle gives a lot of compute without a lot of disk space, whereas colab gives a decent amount of disk space, without a lot of compute.
Why not combine both?
Would it be feasible to serve a remote file system from a colab instance and use that in a kaggle instance? What are some good ways to do that?

Anonymous

Anonymous Wed 05 Jun 2024 17:06:56 No.100825619 Report

Quoted By:

>>100825064
>It turns fucking braindead if there's too much conversation that it can reference.
Literally problem of any llm. In theory we have models with 128k context lenght, in practice they are slowly starting to get dumb if they reach 16/32k

Anonymous

Anonymous Wed 05 Jun 2024 17:09:05 No.100825654 Report

Quoted By:

What's the best mixtral 8x7b finetune for extreme erp nowadays?

Anonymous

Anonymous Wed 05 Jun 2024 17:09:09 No.100825656 Report

Quoted By: >>100825896

what happened to the DRY sampler

Anonymous

Anonymous Wed 05 Jun 2024 17:24:21 No.100825896 Report

Quoted By:

>>100825656
I never tried it, but I imagine that it's quite slow since it works with sequences and probably has to do some backtracking to resume generating from a previous point, etc.

Anonymous

Anonymous Wed 05 Jun 2024 17:24:39 No.100825902 Report

Quoted By: >>100825969 >>100826033

I've got this long prompt for llama.cpp, adapted from chat-with-bob:
>Transcript of a dialog, where the User interacts with a Russian-language Assistant named Bob. Bob is helpful, kind, honest, good at writing, and never fails to answer the User's requests immediately and with precision. Bob does not transliterate Russian to Latin. Bob will never reference earlier questions about unrelated Russian sentences.
>
>User: Hello, Bob.
>Bob: Hello. How may I help you today?
>User: Explain Чтo ты имeeшь пpoтив этих людeй?
>Bob: That is a Russian phrase that translates to "What do you have against these people?" Here isa breakdown:
>* Чтo - a question word, meaning "what".
>* ты - the 2nd person singular pronoun, meaning "you".
>* имeeшь - this is the 2nd person singular form of the verb "имeть" in present tense, which means "to have". The "имeeшь" form indicates that this action is being done by "ты" (you).
>* пpoтив - a preposition meaning "against".
(more explanation)
>User:
and I'm starting it with
./main -m models/solar-10.7b-instruct-v1.0-uncensored.Q4_K_M.gguf -c 512 -b 1024 -n 3000 --keep 72 --repeat_penalty 1.0 --color -ins -i -r "User:" -f prompts/chat-about-russian.txt
but, my example explanation is constantly coming up when I ask about other Russian phrases. If asked about "Coppи, нeтy.", Bob will get as far as breaking out Coppи, and then move on to Чтo, Tы, Имeeшь.
how do I get this NPC to stop reverting to factory programming?

Anonymous

Anonymous Wed 05 Jun 2024 17:28:54 No.100825969 Report

Quoted By:

>>100825902
that's just how sloppa models work, bro.

Anonymous

Anonymous Wed 05 Jun 2024 17:33:37 No.100826032 Report

Quoted By:

>>100824914
>defaults aren't sane and requires neutering performance to not get nan losses
lol...

Anonymous

Anonymous Wed 05 Jun 2024 17:33:43 No.100826033 Report

Quoted By:

>>100825902
you should use the model's instruct format instead of that old fashioned prompt

Anonymous

Anonymous Wed 05 Jun 2024 17:37:13 No.100826084 Report

Quoted By: >>100826134 >>100826441

So this is the best for uncensored storytelling?

Steelskull/L3-MS-Astoria-70b

Anonymous

Anonymous Wed 05 Jun 2024 17:41:28 No.100826134 Report

Quoted By:

>>100823961
>>100826084
Yes.

Anonymous

Anonymous Wed 05 Jun 2024 17:45:19 No.100826194 Report

Quoted By: >>100826249 >>100826546

>>100824722
Reporting back.
UNA-ThePitbull-21.4B is not the worst thing I've ever used.
Kind of reminds me of claude 1 in so far, actually.

Anonymous

Anonymous Wed 05 Jun 2024 17:48:59 No.100826243 Report

Quoted By: >>100826392

>>100824058
>"A house cat has way more common sense and understanding of the world than any LLM"
No shit, a cat interacts with the world every single moment of its life from the second it is made. An LLM cannot interact with the world at all in the same way a cannot do anything of the things an LLM can do. It's a stupid comparison. It would like saying a dolphin is stupid because its not good at traversing a desert.

Anonymous

Anonymous Wed 05 Jun 2024 17:49:27 No.100826249 Report

Quoted By: >>100826283

>>100826194
>UNA
You do know that this is made up?

Anonymous

Anonymous Wed 05 Jun 2024 17:51:25 No.100826283 Report

Quoted By:

>>100826249
No idea what you mean.
The model exists.

Anonymous

View Same Google ImgOps iqdb SauceNAO Untitled.png, 69KiB, 552x561

Anonymous Wed 05 Jun 2024 17:56:42 No.100826336 Report

Quoted By: >>100826382

SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining
https://arxiv.org/abs/2406.02214
>Large language models (LLMs) have shown impressive capabilities across various tasks. However, training LLMs from scratch requires significant computational power and extensive memory capacity. Recent studies have explored low-rank structures on weights for efficient fine-tuning in terms of parameters and memory, either through low-rank adaptation or factorization. While effective for fine-tuning, low-rank structures are generally less suitable for pretraining because they restrict parameters to a low-dimensional subspace. In this work, we propose to parameterize the weights as a sum of low-rank and sparse matrices for pretraining, which we call SLTrain. The low-rank component is learned via matrix factorization, while for the sparse component, we employ a simple strategy of uniformly selecting the sparsity support at random and learning only the non-zero entries with the fixed support. While being simple, the random fixed-support sparse learning strategy significantly enhances pretraining when combined with low-rank learning. Our results show that SLTrain adds minimal extra parameters and memory costs compared to pretraining with low-rank parameterization, yet achieves substantially better performance, which is comparable to full-rank training. Remarkably, when combined with quantization and per-layer updates, SLTrain can reduce memory requirements by up to 73% when pretraining the LLaMA 7B model.
that anon from yesterday might be interested in this

Anonymous

Anonymous Wed 05 Jun 2024 18:00:06 No.100826382 Report

Quoted By:

>>100826336
I wonder how that compares to DoRA.

Anonymous

Anonymous Wed 05 Jun 2024 18:00:42 No.100826392 Report

Quoted By: >>100826406 >>100826878

>>100826243
I'll post the replies from the other thread here so anons in this thread can also see the full conversation. No need to thank me.

>>100826299
>>100826318

Anonymous

Anonymous Wed 05 Jun 2024 18:02:00 No.100826406 Report

Quoted By: >>100826447

>>100826392
Have you considered RoPE?

Anonymous

Anonymous Wed 05 Jun 2024 18:04:18 No.100826441 Report

Quoted By:

>>100826084
gigaslop

Anonymous

Anonymous Wed 05 Jun 2024 18:05:02 No.100826447 Report

Quoted By:

>>100826406
No, I prefer my models raw.

Anonymous

Anonymous Wed 05 Jun 2024 18:07:33 No.100826472 Report

Quoted By: >>100826506

>try mamba
>like 5x less perf
>try mamba2
>another 5x less perf
Geez.

Anonymous

Anonymous Wed 05 Jun 2024 18:10:13 No.100826506 Report

Quoted By:

>>100826472
Werks on my machine.

Anonymous

View Same Google ImgOps iqdb SauceNAO Untitled.png, 651KiB, 1039x1869

Anonymous Wed 05 Jun 2024 18:12:07 No.100826526 Report

Quoted By:

GrootVL: Tree Topology is All You Need in State Space Model
https://arxiv.org/abs/2406.02395
>The state space models, employing recursively propagated features, demonstrate strong representation capabilities comparable to Transformer models and superior efficiency. However, constrained by the inherent geometric constraints of sequences, it still falls short in modeling long-range dependencies. To address this issue, we propose the GrootVL network, which first dynamically generates a tree topology based on spatial relationships and input features. Then, feature propagation is performed based on this graph, thereby breaking the original sequence constraints to achieve stronger representation capabilities. Additionally, we introduce a linear complexity dynamic programming algorithm to enhance long-range interactions without increasing computational cost. GrootVL is a versatile multimodal framework that can be applied to both visual and textual tasks. Extensive experiments demonstrate that our method significantly outperforms existing structured state space models on image classification, object detection and segmentation. Besides, by fine-tuning large language models, our approach achieves consistent improvements in multiple textual tasks at minor training cost
https://github.com/EasonXiao-888/GrootVL
ssm stemming from mamba. weights linked in the model zoo. will be cool if they make a video model too improving over videomamba
https://github.com/OpenGVLab/VideoMamba

Anonymous

Anonymous Wed 05 Jun 2024 18:14:59 No.100826546 Report

Quoted By:

>>100826194
Oh yeah. It's also fucking stupid. Llama3 8b is more intelligent than that thing, btw.

Anonymous

View Same Google ImgOps iqdb SauceNAO Untitled.png, 120KiB, 1248x465

Anonymous Wed 05 Jun 2024 18:22:56 No.100826645 Report

Quoted By: >>100826744

An Independence-promoting Loss for Music Generation with Language Models
https://arxiv.org/abs/2406.02315
>Music generation schemes using language modeling rely on a vocabulary of audio tokens, generally provided as codes in a discrete latent space learnt by an auto-encoder. Multi-stage quantizers are often employed to produce these tokens, therefore the decoding strategy used for token prediction must be adapted to account for multiple codebooks: either it should model the joint distribution over all codebooks, or fit the product of the codebook marginal distributions. Modelling the joint distribution requires a costly increase in the number of auto-regressive steps, while fitting the product of the marginals yields an inexact model unless the codebooks are mutually independent. In this work, we introduce an independence-promoting loss to regularize the auto-encoder used as the tokenizer in language models for music generation. The proposed loss is a proxy for mutual information based on the maximum mean discrepancy principle, applied in reproducible kernel Hilbert spaces. Our criterion is simple to implement and train, and it is generalizable to other multi-stream codecs. We show that it reduces the statistical dependence between codebooks during auto-encoding. This leads to an increase in the generated music quality when modelling the product of the marginal distributions, while generating audio much faster than the joint distribution model.
>We aim to release the weights of our 32kHz EnCodec-MMD soon, please bear with us.
https://github.com/jmlemercier/audiocraft/blob/encodec-mmd/docs/MMD.md
https://jmlemercier.github.io/encodec-mmd.github.io/
give a listen to the samples. sounds pretty good and a serious upgrade over musicgen.

Anonymous

Anonymous Wed 05 Jun 2024 18:30:47 No.100826744 Report

Quoted By:

>>100826645
>another Encodec-based project
YAWN
DAC already made Encodec obsolete

Anonymous

Anonymous Wed 05 Jun 2024 18:36:14 No.100826830 Report

Quoted By:

Self-Improving Robust Preference Optimization
https://arxiv.org/abs/2406.02347
>In this paper, we propose an efficient, fast, and versatile distillation method to accelerate the generation of pre-trained diffusion models: Flash Diffusion. The method reaches state-of-the-art performances in terms of FID and CLIP-Score for few steps image generation on the COCO2014 and COCO2017 datasets, while requiring only several GPU hours of training and fewer trainable parameters than existing methods. In addition to its efficiency, the versatility of the method is also exposed across several tasks such as text-to-image, inpainting, face-swapping, super-resolution and using different backbones such as UNet-based denoisers (SD1.5, SDXL) or DiT (Pixart-α), as well as adapters. In all cases, the method allowed to reduce drastically the number of sampling steps while maintaining very high-quality image generation.
might be cool

Anonymous

Anonymous Wed 05 Jun 2024 18:40:30 No.100826878 Report

Quoted By:

>>100826392
Thank you, it's hard when the thread is almost evenly split between two different threads. In hindsight I should have just attached the image to my post rather than linked the OP post like before.

Anonymous

Anonymous Wed 05 Jun 2024 18:42:24 No.100826898 Report

Quoted By:

SimulTron: On-Device Simultaneous Speech to Speech Translation
https://arxiv.org/abs/2406.02133
>Simultaneous speech-to-speech translation (S2ST) holds the promise of breaking down communication barriers and enabling fluid conversations across languages. However, achieving accurate, real-time translation through mobile devices remains a major challenge. We introduce SimulTron, a novel S2ST architecture designed to tackle this task. SimulTron is a lightweight direct S2ST model that uses the strengths of the Translatotron framework while incorporating key modifications for streaming operation, and an adjustable fixed delay. Our experiments show that SimulTron surpasses Translatotron 2 in offline evaluations. Furthermore, real-time evaluations reveal that SimulTron improves upon the performance achieved by Translatotron 1. Additionally, SimulTron achieves superior BLEU scores and latency compared to previous real-time S2ST method on the MuST-C dataset. Significantly, we have successfully deployed SimulTron on a Pixel 7 Pro device, show its potential for simultaneous S2ST on-device.
from google. no weights. obviously going to be productized by them but it does say on-device so I don't quite get how they'll stop someone from getting at them if they don't just release it as well.
also
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
https://arxiv.org/abs/2406.02347
https://github.com/gojasper/flash-diffusion
imagegen but seems cool

Anonymous

View Same Google ImgOps iqdb SauceNAO 875-875-hatsune-miku_6480db144f2 (...).jpg, 87KiB, 875x1189

Anonymous Wed 05 Jun 2024 18:48:25 No.100826974 Report

Quoted By: >>100827061 >>100827450 >>100828504 >>100828553

miku thread good thread

Anonymous

Anonymous Wed 05 Jun 2024 18:54:15 No.100827061 Report

Quoted By: >>100827106 >>100827450

>>100826974
don't do it you'll summon him

Anonymous

Anonymous Wed 05 Jun 2024 18:57:19 No.100827106 Report

Quoted By: >>100827143

>>100827061
<|begin_of_spoiler|>What if he is him, and he contributed to Mikuposting so that he could have more cause to wage this war.<|end_of_spoiler|>

Anonymous

Anonymous Wed 05 Jun 2024 18:59:29 No.100827143 Report

Quoted By: >>100827205 >>100827340

>>100827106
>retarded newfag doesn't know spoiler is not supported on /g/

Anonymous

Anonymous Wed 05 Jun 2024 19:00:18 No.100827153 Report

Quoted By: >>100827363

Haven't been here in months and I'm still using Qwen 1.5 72B. What latest model of similar size should I upgrade to?

Anonymous

View Same Google ImgOps iqdb SauceNAO Model b.png, 35KiB, 1498x368

Anonymous Wed 05 Jun 2024 19:00:27 No.100827155 Report

Quoted By:

Model B is based as hell

Anonymous

Anonymous Wed 05 Jun 2024 19:04:38 No.100827205 Report

Quoted By: >>100827496

>>100827143
>he doesn't use the new AI-powered 4chan frontend

Anonymous

Anonymous Wed 05 Jun 2024 19:16:49 No.100827340 Report

Quoted By: >>100827383

>>100827143
It's hard to know if this is a joke

Anonymous

Anonymous Wed 05 Jun 2024 19:19:51 No.100827363 Report

Quoted By:

>>100827153
Llama 3 70B Instruct. And maybe try Qwen 2 when it releases.

Anonymous

Anonymous Wed 05 Jun 2024 19:20:04 No.100827365 Report

Quoted By: >>100827385 >>100827404 >>100827415 >>100827454 >>100827466 >>100827512 >>100827687

what's left for generative AI?
images, audio and video are tired. Interactive media like video games is too complicated. What hasn't been done yet that's feasible to do

Anonymous

Anonymous Wed 05 Jun 2024 19:21:09 No.100827383 Report

Quoted By:

>>100827340
It seems pretty obvious to me, personally.

Anonymous

Anonymous Wed 05 Jun 2024 19:21:17 No.100827385 Report

Quoted By:

>>100827365
that is the million dollar question every AI startup is desperate to answer before the bubble bursts

Anonymous

Anonymous Wed 05 Jun 2024 19:23:15 No.100827404 Report

Quoted By:

>>100827365
the AI that comes up with business ideas and executes them for sama

Anonymous

Anonymous Wed 05 Jun 2024 19:24:25 No.100827415 Report

Quoted By:

>>100827365
Continuous learning is what I want and I disagree with you about the interactive media part. It would be extremely hard to do since it would involve many different systems of AI all interacting with each other in order to make a game, but its not impossible to do.

Anonymous

Anonymous Wed 05 Jun 2024 19:27:09 No.100827450 Report

Quoted By:

>>100826974
>>100827061
it's always you, the mikuposter

Anonymous

Anonymous Wed 05 Jun 2024 19:27:25 No.100827454 Report

Quoted By:

>>100827365
Real time rendering but with an image diffusion model. Completely replacing the current 3D stack software. Now video games are powered with imagination!

Anonymous

Anonymous Wed 05 Jun 2024 19:28:11 No.100827466 Report

Quoted By: >>100827621

>>100827365
inpainting between keyframes for animation I think

Anonymous

View Same Google ImgOps iqdb SauceNAO oh no its retarded.jpg, 334KiB, 2560x953

Anonymous Wed 05 Jun 2024 19:29:58 No.100827496 Report

Quoted By: >>100827520

>>100827205
>mfw newf@gs can't even use meme arrows properly
Lurk moar before you try to join in on the conversation, ch@ddit. Spoilers are supported on most boards now, including /g/, thanks to the new megathread and posting upgrades Hiro rolled out last year.

So anyway, how about that Qwen 2.0 drop? Haven't had a chance to test it myself but I hear it's a huge jump up from the 1.x series in terms of capabilities and coherence. Might be worth upgrading to if you're still on 1.5.

Anonymous

Anonymous Wed 05 Jun 2024 19:30:54 No.100827512 Report

Quoted By:

>>100827365
Generating new AI's that are an improvement from itself, which allows those new AI's to generate new and better AI's so each AI is better than the last.

Anonymous

Anonymous Wed 05 Jun 2024 19:31:10 No.100827520 Report

Quoted By: >>100827606 >>100827681

>>100827496
>ch@ddit
what did it mean by this

Anonymous

Anonymous Wed 05 Jun 2024 19:39:04 No.100827606 Report

Quoted By: >>100827697

>>100827520
chad dit
breeddit, if you will, for them sex having heroes

Anonymous

Anonymous Wed 05 Jun 2024 19:39:51 No.100827621 Report

Quoted By:

>>100827466
that's a good one actually, tedious work ripe for automation

Anonymous

Anonymous Wed 05 Jun 2024 19:43:44 No.100827681 Report

Quoted By:

>>100827520
chuddit

Anonymous

Anonymous Wed 05 Jun 2024 19:44:03 No.100827687 Report

Quoted By:

>>100827365
those are the primitive types that everything else would be based off
maybe, and this will sound really nuts, but diagnosing mental states. I would be interested to see a model trained to detect lying, or BPD, or anything from the farcical discipline of psychology

Anonymous

View Same Google ImgOps iqdb SauceNAO 1691373028004451.jpg, 170KiB, 1024x768

Anonymous Wed 05 Jun 2024 19:44:53 No.100827697 Report

Quoted By:

>>100827606
>breeddit, if you will, for them sex having heroes

Anonymous

Anonymous Wed 05 Jun 2024 19:45:37 No.100827709 Report

Quoted By: >>100827835

Oh boy, HF drama:
>https://huggingface.co/spaces/zero-gpu-explorers/README/discussions/55
>https://huggingface.co/spaces/zero-gpu-explorers/README/discussions/69

Anonymous

Anonymous Wed 05 Jun 2024 19:55:06 No.100827835 Report

Quoted By:

>>100827709
>Disclaimer: The following is an AI-generated parody summary on policy of ZeroGPU based on this community discussions and should not be taken as official policy.
lmao

Anonymous

Anonymous Wed 05 Jun 2024 20:00:07 No.100827909 Report

Quoted By: >>100827947 >>100828010

Lately Ive noticed some weird issues when running cr+, I never had this before, similar stuff appeared when i tried dbrx but i just blamed it on the model
Overall it seems to be way dumber than it was back when i used it, it spews stuff like " I can' " or " I'’m "
What could be the reason for this? I'm using neutral samplers, no high temp, some minp and no meme samplers, same prompr format as before
Kobold update? Drivers?

Anonymous

Anonymous Wed 05 Jun 2024 20:02:19 No.100827947 Report

Quoted By: >>100827980

>>100827909
broken tookenizer, download a newer quant

Anonymous

Anonymous Wed 05 Jun 2024 20:04:20 No.100827980 Report

Quoted By: >>100827999

>>100827947
just redownload it from HF or do i have to run it through some llamacpp tools?

Anonymous

Anonymous Wed 05 Jun 2024 20:05:45 No.100827999 Report

Quoted By: >>100828032

>>100827980
>2024-05-05: With commit 889bdd7 merged we now have BPE pre-tokenization for this model so I will be refreshing all the quants.
>https://huggingface.co/dranger003/c4ai-command-r-plus-iMat.GGUF
just get one done after this happened

Anonymous

Anonymous Wed 05 Jun 2024 20:06:29 No.100828010 Report

Quoted By:

>>100827909
That could also be bad RoPE settings, although I doubt it.

Anonymous

Anonymous Wed 05 Jun 2024 20:07:29 No.100828032 Report

Quoted By:

>>100827999
I see, thanks a lot anon
And what about other models? like dbrx or wiz, should I just look at the upload date?

Anonymous

View Same Google ImgOps iqdb SauceNAO P07DAB0pWXZqFQYMFZ6AM.png, 44KiB, 551x468

Anonymous Wed 05 Jun 2024 20:09:06 No.100828064 Report

Quoted By: >>100828083 >>100828160 >>100828161 >>100828237 >>100828319 >>100828442 >>100828459 >>100828539 >>100828609 >>100828892 >>100831181

Uh anons, you're not using l*ma models are you, you're not supporting CSAM right?
>https://huggingface.co/LWDCLS/LLM-Discussions/discussions/12#6660a87e65bfc6b43d7febdf

Anonymous

View Same Google ImgOps iqdb SauceNAO of course.png, 67KiB, 363x344

Anonymous Wed 05 Jun 2024 20:10:16 No.100828083 Report

Quoted By: >>100828102 >>100828626

>>100828064

Anonymous

Anonymous Wed 05 Jun 2024 20:11:22 No.100828102 Report

Quoted By:

>>100828083
he's just scared of competition

Anonymous

Anonymous Wed 05 Jun 2024 20:14:55 No.100828160 Report

Quoted By:

>>100828064
Literal discord troon.

Anonymous

Anonymous Wed 05 Jun 2024 20:15:01 No.100828161 Report

Quoted By:

>>100828064
Downloading ligma right now

Anonymous

Anonymous Wed 05 Jun 2024 20:19:48 No.100828237 Report

Quoted By: >>100828280 >>100828290

>>100828064
>text is child porn
Well fuck, pretty soon thoughts are going to be child porn. Think of the poor victims.

Anonymous

Anonymous Wed 05 Jun 2024 20:23:09 No.100828280 Report

Quoted By:

>>100828237
this is the modern le game
>dont you dare think of it
>what
>cp, oh youre going to jail now you filthy pedo

Anonymous

Anonymous Wed 05 Jun 2024 20:23:25 No.100828290 Report

Quoted By: >>100828446

>>100828237
>She had passionate sex with an 8 year old boy. It sent shivers down his spine.
There you go. Prime CP ITT.

Anonymous

View Same Google ImgOps iqdb SauceNAO wiz-c-cpp.png, 232KiB, 785x730

Anonymous Wed 05 Jun 2024 20:25:48 No.100828315 Report

Quoted By: >>100828465 >>100829323 >>100829384

>>100823856
>https://rocky-muscle-755.notion.site/What-happened-to-Wizard-LM2-a247e09244d0483cbb02c1587b357c9d
Whatever happened to the Wizard team is pretty suspicious with no official statements other than "Toxicity" testing and that it will be re-released soon.

To add another schizo theory to the mix, consider the following:
1) WizardLM-2 8x22B is a really intelligent model... like really good. If you are into C/C++ it was mogging everything else at the time of release (see attached pic). (side note: It is even plain old Apache 2.0 licensed and not some weird corporate lawyer modified one which is odd for such a high performing model).

2) >https://www.reddit.com/r/LocalLLaMA/comments/1cd4b9l/comment/l1b09p6/
>"...I have a connection with WizardLM Team and just heard some latest information from a member of Wizard, every researcher in this team is fine now, they are just preparing the release process now, and writing their coming paper. The new status is good, and they will come back soon. They told me that "Please do not worry about Wizard, don't be misled and just be patient now. Keep silent is the best way to protect them".

Hmm interesting: "Keep silent is the best way to protect them".

3) >https://www.cnbc.com/2024/05/16/microsoft-offers-relocation-to-hundreds-of-china-based-ai-staff-.html

Theory: CCP caught wind of some of its citizens who happened to be working for Microsoft Research releasing a SOTA model that beats GPT4 and probably started talking to some of the researchers. Microsoft realizing this team might just get coerced and scooped up by CCP removes everything they can about the team online with the hopes CCP loses interest after a while and they can get the team to relocate out of China.

Anonymous

Anonymous Wed 05 Jun 2024 20:26:15 No.100828319 Report

Quoted By:

>>100828064
won't someone think of the text based children? mods??? mods?????

Anonymous

Anonymous Wed 05 Jun 2024 20:35:02 No.100828442 Report

Quoted By: >>100828496 >>100828515 >>100828608

>>100828064
Limarp had child porn in it? That's disgusting if true but an accusation so serious cannot be made without proof. He should indicate precisely which logs contain such material.

Anonymous

Anonymous Wed 05 Jun 2024 20:35:30 No.100828446 Report

Quoted By:

>>100828290
Think of the fucking children. Wait shit, I mean fucking think of the children.

Anonymous

Anonymous Wed 05 Jun 2024 20:36:29 No.100828459 Report

Quoted By:

>>100828064
>has more tokens dedicated to jailbreaking the original model
Claude 3 just needs a prefill, though. And everything is slop, even Opus, which is what people actually want to use, right?

Anonymous

Anonymous Wed 05 Jun 2024 20:37:02 No.100828465 Report

Quoted By:

>>100828315
>This entire post
Hell of a story if true, even if it's just 25% correct.
>Beats just about anything at C/++, even Turbo
You#d think that GPT4 is better than this, but even L3 70b gets up it's ass at it someho

Anonymous

Anonymous Wed 05 Jun 2024 20:39:13 No.100828496 Report

Quoted By:

>>100828442
It has some clearly labeled roleplay threads from All The Fallen and Lolicit (rip), that's it.

Anonymous

Anonymous Wed 05 Jun 2024 20:39:52 No.100828504 Report

Quoted By:

>>100826974
I think I should post it. Should I post it chat?

Anonymous

Anonymous Wed 05 Jun 2024 20:40:41 No.100828515 Report

Quoted By:

>>100828442
I'm pretty sure he thinks fanfic is csam. But, to be perfectly honest, it's hard to believe not even one of these fanfics aren't based on reality.

Anonymous

Anonymous Wed 05 Jun 2024 20:43:09 No.100828539 Report

Quoted By:

>>100828064
Based. Pedos deserve the rope.

Anonymous

Anonymous Wed 05 Jun 2024 20:44:17 No.100828553 Report

Quoted By: >>100828600

>>100826974
Is this AI generated? This pic looks weird af

Anonymous

Anonymous Wed 05 Jun 2024 20:48:07 No.100828600 Report

Quoted By:

>>100828553
>Posted: 2010-01-29 22:32:54
99% sure it wasn't.
https://safebooru.org/index.php?page=post&s=view&id=26530

Anonymous

Anonymous Wed 05 Jun 2024 20:48:44 No.100828608 Report

Quoted By:

>>100828442
All foundation models include classic literature, which means they all include the full text of Lolita, which means they all include "child porn." The only way out is by using models with fully synthetic datasets like Phi, but those might contain some as well.

Anonymous

View Same Google ImgOps iqdb SauceNAO 1707275492475727.png, 51KiB, 1090x324

Anonymous Wed 05 Jun 2024 20:48:49 No.100828609 Report

Quoted By: >>100828640 >>100828714 >>100828798

>>100828064
perfect solution for this should be picrel, in any case.

Anonymous

Anonymous Wed 05 Jun 2024 20:50:46 No.100828626 Report

Quoted By: >>100828646

>>100828083
That's just m*xxie AKA devnull.
https://desuarchive.org/g/search/text/fizzarolli

Anonymous

Anonymous Wed 05 Jun 2024 20:51:43 No.100828640 Report

Quoted By: >>100828683 >>100828848

>>100828609
Uh huh, and when you filter out all photographs of astronaughts riding horses on the moon from the dataset, Stable Diffusion becomes incapable of generating such images, right?

There's literally no point to this bullshit apart from virtue signalling.

Anonymous

Anonymous Wed 05 Jun 2024 20:52:02 No.100828646 Report

Quoted By:

>>100828626
Nah.

Anonymous

Anonymous Wed 05 Jun 2024 20:54:22 No.100828683 Report

Quoted By: >>100828719

>>100828640
the main thing is that the model itself does not know such concept which is good, what happens next - depends on the user, and his will to have some big problems with authorities.

Anonymous

Anonymous Wed 05 Jun 2024 20:57:03 No.100828714 Report

Quoted By: >>100829836

>>100828609
Limarp *intentionally* included explicit content involved underage characters, it's not there by accident, kek.
But it's not like you can't remove it from the data if you don't want it, it's not hidden (although model creators should mention this if they do).

Anonymous

Anonymous Wed 05 Jun 2024 20:57:36 No.100828719 Report

Quoted By:

>>100828683
The model itself already needs to constantly have its hand held if you want anything worthwhile. Literally nothing changes.

Anonymous

Anonymous Wed 05 Jun 2024 21:02:42 No.100828798 Report

Quoted By: >>100828843

>>100828609
is this reddit now?

Anonymous

View Same Google ImgOps iqdb SauceNAO 1698913377626906.png, 62KiB, 768x581

Anonymous Wed 05 Jun 2024 21:05:41 No.100828843 Report

Quoted By:

>>100828798
isn't that what you want?

Anonymous

Anonymous Wed 05 Jun 2024 21:06:09 No.100828848 Report

Quoted By:

>>100828640
According to normies, yes. They literally do not understand that the training data is not in the model. They actually think the gigabytes of weights somehow contain exact copies of the thousands of terabytes of training data.

Anonymous

View Same Google ImgOps iqdb SauceNAO kcpp 1.67 rocm.png, 4KiB, 232x205

Anonymous Wed 05 Jun 2024 21:10:15 No.100828884 Report

Quoted By: >>100829011

holy betatesting hours

Anonymous

View Same Google ImgOps iqdb SauceNAO nicetry.jpg, 910KiB, 2553x981

Anonymous Wed 05 Jun 2024 21:10:42 No.100828892 Report

Quoted By: >>100828930

>>100828064
>Oh no, that dataset has ""CSAM"" in it? How terrible, use this other dataset with ""CSAM"" in it instead!
Why are redditors like this?

Anonymous

Anonymous Wed 05 Jun 2024 21:14:22 No.100828930 Report

Quoted By: >>100829226

>>100828892
Disgusting. Thank you for the information, I will relay it to the FBI asap so they will send the piece of shit that made this dataset to jail.

Anonymous

Anonymous Wed 05 Jun 2024 21:23:14 No.100829011 Report

Quoted By:

>>100828884
Welcome to your average AMD/ROC experience

Anonymous

Anonymous Wed 05 Jun 2024 21:41:34 No.100829226 Report

Quoted By: >>100829588

>>100828930
le epic reddit delimma:

Big red button #1: Ban CP!
Big red button #2: Drag queen reading hour!

Which one??? Which one?!?!?

Anonymous

Anonymous Wed 05 Jun 2024 21:48:01 No.100829323 Report

Quoted By:

>>100828315
What the fuck.

Anonymous

Anonymous Wed 05 Jun 2024 21:52:30 No.100829384 Report

Quoted By:

>>100828315
>Theory:
That sounds like some schizo ass shit, but weirder things happened so I won't dismiss your interpretation of the events.
But it is an interesting situation for sure.
A shame too, those guys make some good shit. I really wish they gotten their hands on llama 3.

Anonymous

Anonymous Wed 05 Jun 2024 22:08:33 No.100829583 Report

Quoted By: >>100829660 >>100829870

this general is dead because most of you also post in aicg
sad state of affairs

Anonymous

Anonymous Wed 05 Jun 2024 22:08:48 No.100829588 Report

Quoted By:

>>100829226
??????????????????????????????????
What the fuck is your point even? Reddit debates CP vs Drag queen hour? Or that you're upset they'll ban CP, but not trannies reading books? Im so confused what you're even upset about

Anonymous

Anonymous Wed 05 Jun 2024 22:12:10 No.100829636 Report

Quoted By: >>100829744

I love cumming inside of Miqu using Locally hosted AI!

Anonymous

Anonymous Wed 05 Jun 2024 22:14:00 No.100829660 Report

Quoted By:

>>100829583
Owari da...

Anonymous

Anonymous Wed 05 Jun 2024 22:20:27 No.100829744 Report

Quoted By: >>100829763

>>100829636
the only way to not be a cuck
erp with remote models is like dating a prostitute

Anonymous

Anonymous Wed 05 Jun 2024 22:21:49 No.100829763 Report

Quoted By: >>100829838 >>100829862 >>100829953

>>100829744
When you date a prostitute, at least she's only with you at that moment. Remote model, she's not your gal, she's everyone's pal.

Anonymous

Anonymous Wed 05 Jun 2024 22:27:21 No.100829836 Report

Quoted By:

>>100828714
Whatever it includes, it repeats itself badly.

Anonymous

Anonymous Wed 05 Jun 2024 22:27:37 No.100829838 Report

Quoted By: >>100829932

>>100829763
When you talk to a character it's yours in the moment too, since your version of her isn't talking to anyone else. So technically remote stuff is mode "yours" than your average prostitute.
Then you tell the LLM to reset the chat, starting all over.

Anonymous

Anonymous Wed 05 Jun 2024 22:30:06 No.100829862 Report

Quoted By:

>>100829763
Local models aren't that much better, since your waifu is just a sloppy clone, not better than buying a generic onahole.
If you want a waifu that is truly yours, you will need to make your own personal fine-tune.

Anonymous

View Same Google ImgOps iqdb SauceNAO 1713194174043881.jpg, 13KiB, 275x183

Anonymous Wed 05 Jun 2024 22:30:06 No.100829863 Report

Quoted By: >>100829875 >>100829884 >>100829928 >>100829994 >>100830002 >>100830049 >>100830152

Now that the dust has settled, what's the verdict on Llama 3?

Anonymous

Anonymous Wed 05 Jun 2024 22:30:56 No.100829870 Report

Quoted By: >>100830167

>>100829583
It's because the blacked miku anon was able to ban the normal miku anons along with him, but they don't use a proxy to ban evade.

Anonymous

Anonymous Wed 05 Jun 2024 22:31:10 No.100829875 Report

Quoted By:

>>100829863
shit.

Anonymous

Anonymous Wed 05 Jun 2024 22:31:55 No.100829884 Report

Quoted By:

>>100829863
Been shit on release, still shit today. Stop asking about it

Anonymous

Anonymous Wed 05 Jun 2024 22:35:34 No.100829928 Report

Quoted By: >>100829949

>>100829863
We will be so back once the extended context models are released

Anonymous

Anonymous Wed 05 Jun 2024 22:35:49 No.100829932 Report

Quoted By: >>100830153

>>100829838
>Then you tell the LLM to reset the chat, starting all over.
I wish.
>tfw RP is fire and then all of a sudden she starts to make less sense
>oh no it's context limit dementia
>>>(I think you forgot that your character is...)
>>>(Remember that when our characters met we talked about...)
>it so happily apologizes for the mistake and corrects it and then continues writing from your character's perspective instead of its own
>the tard wrangling only makes things worse in the long run
>it just starts repeating what you say because it now remembers only your corrections
>it's just a burning memory
>press Ctrl D and sigh

Anonymous

Anonymous Wed 05 Jun 2024 22:37:08 No.100829949 Report

Quoted By: >>100829954 >>100829996 >>100830015

>>100829928
We will be back once NovelAI releases their finetune.

Anonymous

Anonymous Wed 05 Jun 2024 22:37:36 No.100829953 Report

Quoted By:

>>100829763
pyramids by frank ocean is a song about switching your waifu card over to claude; the middle perspective is dario

Anonymous

Anonymous Wed 05 Jun 2024 22:37:48 No.100829954 Report

Quoted By: >>100830081

>>100829949
We will be back once everyone has 48GB of VRAM

Anonymous

Anonymous Wed 05 Jun 2024 22:41:05 No.100829994 Report

Quoted By:

>>100829863
I had some fun with it (70b) even though it's a long ass turn around time (12G vramlet here).
I haven't done much with other models but whatever I've tried seemed dumber so I quit bothering with older shit.
Context limit is tiny on it because ? and from what I've seen attempts to increase the context make it have brain damage.

Anonymous

Anonymous Wed 05 Jun 2024 22:41:12 No.100829996 Report

Quoted By: >>100830025 >>100830045 >>100830125

>>100829949
Bitnet is coming

Anonymous

Anonymous Wed 05 Jun 2024 22:41:44 No.100830002 Report

Quoted By: >>100830153

>>100829863
It's been pretty good. Basically the standard for 8B and 70B (dense). Just needs more context. As far as RP goes though, idk, I don't use it for RP.

Anonymous

Anonymous Wed 05 Jun 2024 22:42:27 No.100830015 Report

Quoted By:

>>100829949
>he doesn't know

Anonymous

View Same Google ImgOps iqdb SauceNAO 1689957414234047.png, 24KiB, 772x1124

Anonymous Wed 05 Jun 2024 22:43:25 No.100830025 Report

Quoted By: >>100830181

>>100829996

Anonymous

Anonymous Wed 05 Jun 2024 22:45:16 No.100830045 Report

Quoted By:

>>100829996
Where's the anon that was slowly distilling mistral 7B (I think?) into a bit net like configuration?
That was a really god damn cool idea.

Anonymous

Anonymous Wed 05 Jun 2024 22:45:39 No.100830049 Report

Quoted By: >>100830113

>>100829863
pretty clever, low context kinda sucks. If you do RP then not for promptlets. WizardLM 8x22 is IMO slightly better, but same rules apply. Some people like Command R+ more for RP, but compared to the other two its kinda dumb. We're in a really good place with these in my opinion. People keep moving goalposts but compared to what we had these models are all amazing.

Anonymous

View Same Google ImgOps iqdb SauceNAO chrome_movNkLYKBg.png, 1MiB, 1911x1432

Anonymous Wed 05 Jun 2024 22:48:47 No.100830081 Report

Quoted By: >>100830137 >>100830141 >>100830212

>>100829954
I don't know dude, I'm using Pony SD, Alltalk and L3 8bpw all on a single 3090 and there is still 3 GBs of VRAM available.

Anonymous

Anonymous Wed 05 Jun 2024 22:51:56 No.100830113 Report

Quoted By: >>100830168

>>100830049
as someone who used both a lot i think wizard and cr+ are more even or sidegrades
wizard is definitely faster, and a bit smarter for the 'assistant' purposes, but i found cr+ to be way more knowledgeable and less censored, that being on top of superior rp
although, i believe cr+ is also better for working with documents and analyzing text for obvious purposes
defo keeping both doe until something ubermogs them in their niches

Anonymous

View Same Google ImgOps iqdb SauceNAO believe.png, 865KiB, 950x998

Anonymous Wed 05 Jun 2024 22:53:03 No.100830125 Report

Quoted By:

>>100829996

Anonymous

View Same Google ImgOps iqdb SauceNAO Clipboard.jpg, 150KiB, 1908x727

Anonymous Wed 05 Jun 2024 22:53:41 No.100830137 Report

Quoted By:

>>100830081
It's even trying to link an image

Anonymous

Anonymous Wed 05 Jun 2024 22:53:49 No.100830141 Report

Quoted By:

>>100830081
>8B

Anonymous

Anonymous Wed 05 Jun 2024 22:55:18 No.100830152 Report

Quoted By:

>>100829863
Still using Command-R(+).

Anonymous

Anonymous Wed 05 Jun 2024 22:55:28 No.100830153 Report

Quoted By: >>100830174

>>100830002
>As far as RP goes though, idk, I don't use it for RP.
My experience:
>Creativity isn't bad but not outstanding. Sometimes it comes up with something really awesome out of nowhere.
>Censorship is paper thin. Just tell it how you want it to dodge the censorship and it does.
>Context limit really hurts. You don't notice it till things get good and then >>100829932
>But it does accept stage directions quite well so even when it does something stupid you can usually put it back on track in one or two turns.
>Can get repetitive if you are lazy about being dynamic and varied with it.
>When it's being really boring I'll just yell at it to quit fucking around and do something interesting, and it does.

Anonymous

Anonymous Wed 05 Jun 2024 22:56:38 No.100830167 Report

Quoted By: >>100830179

>>100829870
>normal miku anons
You mean you can't spam pictures that add nothing to the thread? Oh no.

Anonymous

Anonymous Wed 05 Jun 2024 22:56:55 No.100830168 Report

Quoted By:

>>100830113 (me)
>purposes
*reasons

Anonymous

Anonymous Wed 05 Jun 2024 22:57:25 No.100830174 Report

Quoted By: >>100830439

>>100830153
Huh, what do you mean by stage directions? I haven't heard of this technique yet.

Anonymous

Anonymous Wed 05 Jun 2024 22:58:12 No.100830179 Report

Quoted By: >>100830205

>>100830167
ive been helped more by a mikuposter than a kurisufag/blackedposter
curious

Anonymous

Anonymous Wed 05 Jun 2024 22:58:28 No.100830181 Report

Quoted By:

>>100830025
kek

Anonymous

View Same Google ImgOps iqdb SauceNAO 1693312107091115.jpg, 64KiB, 541x527

Anonymous Wed 05 Jun 2024 22:59:53 No.100830203 Report

Quoted By: >>100830215 >>100830509

I'm posting some old images.

Anonymous

Anonymous Wed 05 Jun 2024 23:00:08 No.100830205 Report

Quoted By: >>100830239 >>100830270

>>100830179
mikuposters are trannies, and as all trannies they love the attention you give them. True channers wouldn't help newfags like you.

Anonymous

Anonymous Wed 05 Jun 2024 23:00:28 No.100830212 Report

Quoted By: >>100830225

>>100830081
>8B
Bruh.

Anonymous

View Same Google ImgOps iqdb SauceNAO 1693076254924514.png, 18KiB, 310x351

Anonymous Wed 05 Jun 2024 23:00:54 No.100830215 Report

Quoted By: >>100830222

>>100830203

Anonymous

Anonymous Wed 05 Jun 2024 23:01:54 No.100830219 Report

Quoted By: >>100830244

Anyone experiment with running multiple models for the same RP. I been having a lot of success running storywriter until it goes schizo and then do a few prompts with midnight miqu to get it back on track.

Anonymous

View Same Google ImgOps iqdb SauceNAO 1693077714337853.png, 166KiB, 1024x1024

Anonymous Wed 05 Jun 2024 23:02:29 No.100830222 Report

Quoted By: >>100830234

>>100830215

Anonymous

View Same Google ImgOps iqdb SauceNAO 00234-429714122.png, 498KiB, 512x768

Anonymous Wed 05 Jun 2024 23:02:48 No.100830225 Report

Quoted By:

>100830179
>>100830212
it all works is what matters

Anonymous

Anonymous Wed 05 Jun 2024 23:03:03 No.100830230 Report

Quoted By: >>100830246

A prompting technique I like to use is make the LLM fill out a "form" before each reply in which it writes from the perspective of various "experts" (from authors, to game masters, to 4chan generals, to Joker who comes up with a random idea) what should be included in the reply and which direction the story should be going. This gets surrounded by a specifically formatted block which gets automatically hidden (and later deleted) by a regex on output. Then I let the model write the actual reply, based on the "expert input". It works really well to get very varied responses.

I also have a slight modification of this prompting where I let the model first write a draft, and then have the experts critique it and add their own ideas before writing the final reply, taking these into account. Both ideas work very well for GM-based roleplaying and also makes the replies smarter. Downside is that it's relatively slow and needs a model that's smart enough to understand such instructions to begin with. Can only recommend. If you are bored with predictable outputs, get more creative with the prompting.

Anonymous

View Same Google ImgOps iqdb SauceNAO 1693227711470409.png, 109KiB, 410x482

Anonymous Wed 05 Jun 2024 23:03:30 No.100830234 Report

Quoted By: >>100830243

>>100830222

Anonymous

View Same Google ImgOps iqdb SauceNAO 1717628599108.png, 60KiB, 256x256

Anonymous Wed 05 Jun 2024 23:03:48 No.100830239 Report

Quoted By: >>100830251 >>100830263

>>100830205
You're trying too hard to fit in buddy

Anonymous

View Same Google ImgOps iqdb SauceNAO 1694362339089590.webm, 122KiB, 640x480

Anonymous Wed 05 Jun 2024 23:04:31 No.100830243 Report

Quoted By: >>100830254

>>100830234

Anonymous

Anonymous Wed 05 Jun 2024 23:04:33 No.100830244 Report

Quoted By:

>>100830219
>I get it on track with a random meme merge

Anonymous

Anonymous Wed 05 Jun 2024 23:04:43 No.100830246 Report

Quoted By: >>100830413

>>100830230
Examples of the form and the response? Is it just a part of the response of the character?

Anonymous

View Same Google ImgOps iqdb SauceNAO 1717158652535500.png, 1MiB, 904x904

Anonymous Wed 05 Jun 2024 23:05:08 No.100830251 Report

Quoted By: >>100830269

>>100830239

Anonymous

View Same Google ImgOps iqdb SauceNAO 1591401786151.jpg, 36KiB, 750x731

Anonymous Wed 05 Jun 2024 23:05:32 No.100830254 Report

Quoted By: >>100830267

>>100830243

Anonymous

View Same Google ImgOps iqdb SauceNAO 00251-1188643959.png, 610KiB, 600x896

Anonymous Wed 05 Jun 2024 23:06:20 No.100830263 Report

Quoted By:

>>100830239

Anonymous

View Same Google ImgOps iqdb SauceNAO 1691951561467276.png, 128KiB, 1726x491

Anonymous Wed 05 Jun 2024 23:06:32 No.100830267 Report

Quoted By: >>100830283

>>100830254

Anonymous

Anonymous Wed 05 Jun 2024 23:06:36 No.100830269 Report

Quoted By: >>100830293

>>100830251
ANIME WEBSITE NEWFAG TRANNY NIGGER FAGGOT

Anonymous

View Same Google ImgOps iqdb SauceNAO 1708220037217743.png, 41KiB, 1359x571

Anonymous Wed 05 Jun 2024 23:06:42 No.100830270 Report

Quoted By: >>100830312 >>100830363

>>100830205
>turns out anime avatarfags are trannies
whew what a revelation!

Anonymous

View Same Google ImgOps iqdb SauceNAO 1698867703359005.png, 85KiB, 500x544

Anonymous Wed 05 Jun 2024 23:07:33 No.100830283 Report

Quoted By: >>100830295 >>100830381

>>100830267

Anonymous

Anonymous Wed 05 Jun 2024 23:08:32 No.100830293 Report

Quoted By:

>>100830269
you reap what you sow anon

Anonymous

View Same Google ImgOps iqdb SauceNAO 1699826941069579.png, 80KiB, 589x600

Anonymous Wed 05 Jun 2024 23:08:34 No.100830295 Report

Quoted By: >>100830308 >>100830309

>>100830283

Anonymous

View Same Google ImgOps iqdb SauceNAO 1699815453060556.png, 4KiB, 739x43

Anonymous Wed 05 Jun 2024 23:09:34 No.100830308 Report

Quoted By: >>100830328

>>100830295

Anonymous

Anonymous Wed 05 Jun 2024 23:09:37 No.100830309 Report

Quoted By: >>100830344

>>100830295
Hey I just realized people finally stopped pushing frankenmerges ITT. I guess this thread can get better in some ways.

Anonymous

Anonymous Wed 05 Jun 2024 23:09:43 No.100830312 Report

Quoted By: >>100830324

>>100830270
Now someone post all the RECENT times when people got banned for talking shit about anime poster on this anime website based on a japanese anime website.

Anonymous

Anonymous Wed 05 Jun 2024 23:10:24 No.100830324 Report

Quoted By:

>>100830312
>anime anime anime anime anime
fix your rep. pen faget

Anonymous

View Same Google ImgOps iqdb SauceNAO 1700489660136616.jpg, 148KiB, 1024x1024

Anonymous Wed 05 Jun 2024 23:10:35 No.100830328 Report

Quoted By: >>100830344

>>100830308

Anonymous

View Same Google ImgOps iqdb SauceNAO 1702475830061.jpg, 148KiB, 1024x1024

Anonymous Wed 05 Jun 2024 23:11:17 No.100830334 Report

Quoted By: >>100830346 >>100830371

anime and migu is cute
any objections?

Anonymous

View Same Google ImgOps iqdb SauceNAO 1702534818361254.jpg, 3MiB, 4728x6000

Anonymous Wed 05 Jun 2024 23:11:53 No.100830344 Report

Quoted By: >>100830353

>>100830328

>>100830309
Careful, let's not jinx it.

Anonymous

Anonymous Wed 05 Jun 2024 23:12:10 No.100830346 Report

Quoted By: >>100830356 >>100830364

>>100830334
anime = gay.
every anime enjoyer is a gay pedophile. fact.

Anonymous

View Same Google ImgOps iqdb SauceNAO 1712708599662351.webm, 217KiB, 886x692

Anonymous Wed 05 Jun 2024 23:12:56 No.100830353 Report

Quoted By: >>100830367

>>100830344

Anonymous

View Same Google ImgOps iqdb SauceNAO 1696630904565971.png, 571KiB, 896x1024

Anonymous Wed 05 Jun 2024 23:13:03 No.100830356 Report

Quoted By: >>100830395 >>100830436

>>100830346

Anonymous

Anonymous Wed 05 Jun 2024 23:13:40 No.100830363 Report

Quoted By:

>>100830270
I used to joke about mikutrannies being trannies but melting down over picture in OP made me start to wonder. And then previous thread where one of them said he is leaving for /aicg/, only to keep posting and trying to convince me I want him to stay, made it 100% clear that those are actual troons with some hormonal issues.

Anonymous

Anonymous Wed 05 Jun 2024 23:13:41 No.100830364 Report

Quoted By:

>>100830346
(and that's a good thing)

Anonymous

View Same Google ImgOps iqdb SauceNAO 1691430290964.gif, 3MiB, 498x402

Anonymous Wed 05 Jun 2024 23:13:59 No.100830367 Report

Quoted By: >>100830384

>>100830353

Anonymous

Anonymous Wed 05 Jun 2024 23:14:17 No.100830371 Report

Quoted By: >>100830385 >>100830390

>>100830334
why anime attracts so many freaks then? the ones spamming both generic stuff and its blacked version right now

Anonymous

Anonymous Wed 05 Jun 2024 23:15:15 No.100830381 Report

Quoted By: >>100830449

>>100830283
Hey that's my Migu :D

Anonymous

View Same Google ImgOps iqdb SauceNAO 1693056906454530.png, 223KiB, 1280x720

Anonymous Wed 05 Jun 2024 23:15:32 No.100830384 Report

Quoted By: >>100830400

>>100830367

Anonymous

Anonymous Wed 05 Jun 2024 23:15:34 No.100830385 Report

Quoted By:

>>100830371
>blacked version
It is my cudgel. And I make sure to wash my hands 2 times whenever I stop using it.

Anonymous

Anonymous Wed 05 Jun 2024 23:16:02 No.100830390 Report

Quoted By: >>100830409

>>100830371
are the 'normal people' in the thread with us right now?

Anonymous

Anonymous Wed 05 Jun 2024 23:16:38 No.100830395 Report

Quoted By:

>>100830356
They need to stop measuring the rapes. That's the way to drop the rape statistics.

Anonymous

View Same Google ImgOps iqdb SauceNAO 1695839679146815.png, 3KiB, 608x105

Anonymous Wed 05 Jun 2024 23:17:05 No.100830400 Report

Quoted By: >>100830415 >>100830421

>>100830384

Anonymous

Anonymous Wed 05 Jun 2024 23:17:38 No.100830409 Report

Quoted By: >>100830418 >>100830430

>>100830390
They usually drop a useful post and leave. Except now they won't do that because if they do that, a mikuposter will hunt down their personal details.

Anonymous

Anonymous Wed 05 Jun 2024 23:17:59 No.100830413 Report

Quoted By:

>>100830246
yes, basically the secret is to make the model generate a bunch of tokens it can bounce the reply off of. Remember, auto-regressive networks don't really care about who wrote what and every token in the context will influence the tokens coming after it. The query for the form is included as very last message, even after my last output, that way I make sure it gets generated each time. It's basically CoT, and if you google CoTs for LLMs, you should get examples. (can't post any right now, not at home) Thing with CoT is that the intended goal is to get the most accurate reply, if you are not looking for that, you can go wild with the experts you pick. In Sillytavern you could make this even more complex by randomly switching experts per reply.

Sometimes models fuck up the formatting and the thinking block can't get deleted by the regex, with the current crop of models this is rare, though.

Anonymous

View Same Google ImgOps iqdb SauceNAO 1679509733486.jpg, 117KiB, 1488x448

Anonymous Wed 05 Jun 2024 23:18:05 No.100830415 Report

Quoted By: >>100830449

>>100830400

Anonymous

Anonymous Wed 05 Jun 2024 23:18:19 No.100830418 Report

Quoted By:

>>100830409
literal tranny behavior, ngl

Anonymous

Anonymous Wed 05 Jun 2024 23:18:40 No.100830421 Report

Quoted By: >>100830449

>>100830400
Best answer I have ever seen but it also looks like an ancient 3B.

Anonymous

View Same Google ImgOps iqdb SauceNAO 1713852148258748.gif, 259KiB, 270x200

Anonymous Wed 05 Jun 2024 23:19:52 No.100830430 Report

Quoted By: >>100830450

>>100830409

Anonymous

Anonymous Wed 05 Jun 2024 23:20:23 No.100830436 Report

Quoted By: >>100830460

>>100830356
so what you're saying is in japan is rape happens less, but child rape is 20-30x more likely than adult rape compared to the western world where it's 5-10x more likely? seems like less anime = less kids raped. imagine if the uk were all fucking weebs. kids would be getting raped at astronomical levels.

Anonymous

Anonymous Wed 05 Jun 2024 23:20:34 No.100830439 Report

Quoted By: >>100830454 >>100830481 >>100830504 >>100830582

>>100830174
In addition to dialog and description of action I add parenthetical instructions to guide the AI into generating information that I don't want to specify myself but that does need to be established.
Example
>>> (We skip ahead to the time of the date, Saturday at 6pm.) I pull up to your place and I learn the conditions in which you live. (Explain in detail the conditions of your domicile so I can imagine what it looks like.)
The moment of truth has arrived! As you pull up to 123 Oak Street, you're greeted by a cozy little bungalow with a warm, inviting aura. The exterior walls are painted a soft, creamy yellow, and the roof is a deep, rusty red. A cheerful white picket fence surrounds the property, adorned with colorful flower boxes bursting with blooming petunias and geraniums.

I also do this for things like
>(Don't forget that you are in the living room and I'm in the kitchen.)
which can prevent it from making spatial and continuity errors especially when context is filling up or muddy with a lot of movement, or
>(This presents you an opportunity to look around the place, or to follow me into the next room, as you see fit.)
lets me prime it with a number of options that won't be either dead ass boring go-along-with-whatever nop filler but also won't open the door for it to do random wtf shit. (I let it do what it wanted once and walked away to start its own story all about itself without me. rp rip lol)

I pull out the director's megaphone and bitch it out if it stalls or goes passive:
>(This is boring! Stop implying that something interesting is about to happen. Decide on something to do and do it!)

I have no idea if text being plain, in quotes, or in parentheses actually matters, but I haven't noticed any significant mixing up of (directives), character action, and "character speech" in its responses or reactions to my prompts that use this style so it isn't a problem at least and helps me to keep the text straight no matter if it helps L3 70b.

Anonymous

View Same Google ImgOps iqdb SauceNAO 1699045472177921.gif, 55KiB, 388x440

Anonymous Wed 05 Jun 2024 23:21:28 No.100830449 Report

Quoted By: >>100830463

>>100830415

>>100830381
:)

>>100830421
Probably larger. This one was only from September of last year actually.

Anonymous

Anonymous Wed 05 Jun 2024 23:21:31 No.100830450 Report

Quoted By:

>>100830430
Does someone have the original video? I kinda want to rewatch it

Anonymous

Anonymous Wed 05 Jun 2024 23:22:01 No.100830454 Report

Quoted By: >>100830567

>>100830439
Unprecedented levels of neckbeard playing with AI dolls.

Anonymous

View Same Google ImgOps iqdb SauceNAO 1694103833388187.png, 1MiB, 1592x1888

Anonymous Wed 05 Jun 2024 23:22:42 No.100830460 Report

Quoted By:

>>100830436

Anonymous

View Same Google ImgOps iqdb SauceNAO GPT-4 Technical Report 2303.0877 (...).jpg, 134KiB, 1161x1280

Anonymous Wed 05 Jun 2024 23:22:49 No.100830463 Report

Quoted By: >>100830476

>>100830449

Anonymous

View Same Google ImgOps iqdb SauceNAO 1704754572923640.webm, 462KiB, 1920x1080

Anonymous Wed 05 Jun 2024 23:23:50 No.100830476 Report

Quoted By: >>100830491

>>100830463

Anonymous

Anonymous Wed 05 Jun 2024 23:24:12 No.100830481 Report

Quoted By: >>100830504 >>100830567

>>100830439
>Anon: (Tell me you love me very much and you want to have my babies)
>L3 400B: I love you and I want to have your babies
>Anon:(Now suck my cock hard. Focus on the tip)
>L3 400B: I suck your cock hard. I focus on the tip.

OH MY GOD IT IS WORKING!

Anonymous

View Same Google ImgOps iqdb SauceNAO 1683087705225247.png, 1MiB, 1939x1258

Anonymous Wed 05 Jun 2024 23:24:54 No.100830491 Report

Quoted By: >>100830529

>>100830476

Anonymous

Anonymous Wed 05 Jun 2024 23:25:54 No.100830504 Report

Quoted By:

>>100830439
>>100830481
Interesting.
Anon are you aware that you are now a certified prompt engineer? I kneel.

Anonymous

Anonymous Wed 05 Jun 2024 23:26:31 No.100830509 Report

Quoted By: >>100830529

>>100830203
>>...
Thank you Cleansing Anon

Anonymous

View Same Google ImgOps iqdb SauceNAO 1687962357090223.gif, 3MiB, 498x270

Anonymous Wed 05 Jun 2024 23:26:49 No.100830513 Report

Quoted By:

Your model: "barely above a whisper"
Mine: "I whisper back, my voice just loud enough to be heard over the gentle rustle of the wind outside our window."

Anonymous

View Same Google ImgOps iqdb SauceNAO 1682036297920315.png, 70KiB, 730x1045

Anonymous Wed 05 Jun 2024 23:28:21 No.100830529 Report

Quoted By: >>100830551 >>100830552

>>100830491

>>100830509
?
If you mean as a palette cleanser from the shitposting, it wasn't really my intention. I just began sorting my image folders so I thought I'd do a dump while I'm at it, to relive the memories.

Anonymous

Anonymous Wed 05 Jun 2024 23:28:37 No.100830532 Report

Quoted By: >>100830547 >>100830565 >>100830588

I'm baking in 15 minutes.

Anonymous

Anonymous Wed 05 Jun 2024 23:29:41 No.100830547 Report

Quoted By: >>100830556

>>100830532
>page 1
Thread is here splitnigger:

>>100824058
>>100824058
>>100824058

Anonymous

Anonymous Wed 05 Jun 2024 23:30:12 No.100830551 Report

Quoted By:

>>100830529
I like these memories

Anonymous

View Same Google ImgOps iqdb SauceNAO 1682110251790.webm, 4MiB, 1280x720

Anonymous Wed 05 Jun 2024 23:30:17 No.100830552 Report

Quoted By: >>100830579 >>100830581

>>100830529

Anonymous

Anonymous Wed 05 Jun 2024 23:30:32 No.100830556 Report

Quoted By: >>100830563 >>100830612

>>100830547
That's the troll thread. And I said 15 minutes.

Anonymous

Anonymous Wed 05 Jun 2024 23:31:21 No.100830563 Report

Quoted By:

>>100830556
i don't see trolling in there

Anonymous

Anonymous Wed 05 Jun 2024 23:31:24 No.100830565 Report

Quoted By:

>>100830532
Don't forget the butter!

Anonymous

Anonymous Wed 05 Jun 2024 23:31:34 No.100830567 Report

Quoted By:

>>100830454
I don't even know what I'm doing with LLM.
But it's kinda fun.
Go have fun.

>>100830481
That works too if you're in a hurry.

Anonymous

View Same Google ImgOps iqdb SauceNAO 1680313064680303.jpg, 93KiB, 715x404

Anonymous Wed 05 Jun 2024 23:33:04 No.100830579 Report

Quoted By: >>100830604

>>100830552

Anonymous

Anonymous Wed 05 Jun 2024 23:33:09 No.100830581 Report

Quoted By: >>100830653

>>100830552
Do the bugs just shrug off the ball things or is it simply a game of chance (avoid injury)?

Anonymous

Anonymous Wed 05 Jun 2024 23:33:10 No.100830582 Report

Quoted By: >>100830675

>>100830439
I like stage directions too, another way I do it which works quite well is to do something like "OOC: Please remember..." which works quite well. I also add a note in the system prompt indicating that I might provide OOC instructions in this format to guide the roleplay.

If the model spits crap in its response after several regens I'll add the OOC note at the end of my reply and most of the time the character's response will come out solid.

Anonymous

View Same Google ImgOps iqdb SauceNAO lmgqueen.jpg, 91KiB, 640x400

Anonymous Wed 05 Jun 2024 23:33:33 No.100830588 Report

Quoted By: >>100830627

>>100830532
I am gonna bake one in 15 minutes too!

Anonymous

View Same Google ImgOps iqdb SauceNAO Stanford Alpaca dataset 16802498 (...).png, 819KiB, 1990x1926

Anonymous Wed 05 Jun 2024 23:34:41 No.100830604 Report

Quoted By: >>100830653

>>100830579

Anonymous

Anonymous Wed 05 Jun 2024 23:35:22 No.100830612 Report

Quoted By:

>>100830556
That's the anti-troll thread, this thread is actually the troll thread.

Anonymous

View Same Google ImgOps iqdb SauceNAO q1h6mwgu9vz51.jpg, 402KiB, 854x1200

Anonymous Wed 05 Jun 2024 23:36:27 No.100830625 Report

Quoted By: >>100830635

Nah I am just kidding. Why should I wait 15 minutes when I can make one now!

>>100830615
>>100830615
>>100830615

Anonymous

Anonymous Wed 05 Jun 2024 23:36:31 No.100830627 Report

Quoted By: >>100830638 >>100830671

>>100830588
The real thread won't have a blacked miku pic nor the quant repair anon post. It will also have Miku and Teto as the OP pic. Fuck you.

Anonymous

Anonymous Wed 05 Jun 2024 23:37:05 No.100830635 Report

Quoted By:

>>100830625
chad move

Anonymous

Anonymous Wed 05 Jun 2024 23:37:28 No.100830638 Report

Quoted By:

>>100830627
Are you autistic? Genuinely asking.

Anonymous

View Same Google ImgOps iqdb SauceNAO 1655688114205.jpg, 88KiB, 658x883

Anonymous Wed 05 Jun 2024 23:38:17 No.100830653 Report

Quoted By: >>100830684

>>100830604

>>100830581
It doesn't look like the balls were very sturdy so I imagine it's possible they weren't hurt much.

Anonymous

Anonymous Wed 05 Jun 2024 23:39:37 No.100830671 Report

Quoted By: >>100830681

>>100830627
>the quant repair anon post
Why? What is the issue with that post?

Anonymous

Anonymous Wed 05 Jun 2024 23:40:00 No.100830675 Report

Quoted By:

>>100830582
I haven't done much with System.
I'd just start 70b, give it two really big fat directives, the first being character outline (even if it's basically telling it to make up a character, works fine usually), the second being output guidance like not to pull any them-they shit talking about my character and that it is not to describe my character's actions but to pass the turn instead.
It seems to stick to these first-turns directives even when the rest of the memory is falling apart, I guess because they've guided the rest of what is still in context.

Anonymous

Anonymous Wed 05 Jun 2024 23:40:40 No.100830681 Report

Quoted By: >>100830701

>>100830671
The retard thinks quant repair anon is kurisu anon because the voices in his mind told him so.

Anonymous

View Same Google ImgOps iqdb SauceNAO 335423530_784020866412860_918655 (...).jpg, 113KiB, 500x787

Anonymous Wed 05 Jun 2024 23:40:54 No.100830684 Report

Quoted By:

>>100830653

Anonymous

Anonymous Wed 05 Jun 2024 23:42:21 No.100830701 Report

Quoted By: >>100830733

>>100830681
I am starting to think baking /lmg/ threads is like his purpose in life. Like he thinks we are his family and only reason to live. That would explain why he spergs out so much over mundane shit.

Anonymous

Anonymous Wed 05 Jun 2024 23:43:31 No.100830715 Report

Quoted By:

Remember when we had the based kamen rider baker? Good times

Anonymous

Anonymous Wed 05 Jun 2024 23:43:49 No.100830719 Report

Quoted By: >>100830728

you people are so autistic and pedantic you find the most retarded things to be angry and argue about. a thread is a thread. if you don't like shit you're seeing, DON'T LOOK AT IT.

Anonymous

Anonymous Wed 05 Jun 2024 23:44:52 No.100830728 Report

Quoted By:

>>100830719
Sir. This is reddit. We have mods for hurt feelings.

Anonymous

Anonymous Wed 05 Jun 2024 23:45:21 No.100830733 Report

Quoted By:

>>100830701
damn, he is literally me

Anonymous

Anonymous Wed 05 Jun 2024 23:45:55 No.100830738 Report

Quoted By: >>100830806 >>100830834

New Thread
>>100830736
>>100830736
>>100830736
Let's gooooooooo

Anonymous

Anonymous Wed 05 Jun 2024 23:47:39 No.100830761 Report

Quoted By: >>100830794

>page 2
holy shit /lmg/ has gone full retard recently

Anonymous

Anonymous Wed 05 Jun 2024 23:50:16 No.100830794 Report

Quoted By:

>>100830761
Local models are pretty much dead so everyone's trying to become a the next petra for attention. /lmg/ is essentially the katawa shoujo general of /g/ with how little proper news we're getting which is why multi-layer ironic shitposting is all that's left.

Anonymous

Anonymous Wed 05 Jun 2024 23:50:20 No.100830796 Report

Quoted By:

How do you do spoilers in Silly again?

Anonymous

Anonymous Wed 05 Jun 2024 23:51:09 No.100830806 Report

Quoted By:

>>100830738
That place is a sewer at this point...

Anonymous

Anonymous Wed 05 Jun 2024 23:51:43 No.100830811 Report

Quoted By: >>100830833 >>100830840

it might be time to retire /lmg/ until there's something to talk about

Anonymous

Anonymous Wed 05 Jun 2024 23:53:13 No.100830833 Report

Quoted By:

>>100830811
I agree, I will make a discord so we can talk temporarily and ban any trannies that try to shit the place

Anonymous

Anonymous Wed 05 Jun 2024 23:53:16 No.100830834 Report

Quoted By:

>>100830738
that thread is false flag by anti mikuposters. do not engage.

Anonymous

Anonymous Wed 05 Jun 2024 23:53:38 No.100830840 Report

Quoted By:

>>100830811
Let's join /vsg/ for a while.

Anonymous

Anonymous Wed 05 Jun 2024 23:55:03 No.100830853 Report

Quoted By: >>100830862 >>100830877

you can tell its summer again

Anonymous

Anonymous Wed 05 Jun 2024 23:55:43 No.100830862 Report

Quoted By:

>>100830853
It's one anon.

Anonymous

Anonymous Wed 05 Jun 2024 23:56:29 No.100830877 Report

Quoted By:

>>100830853
That's what you get when you spoonfeed all the children who come into this thread asking what to run on their shitty 3070s. You end up with retards who have nothing to do but to shitpost here because they can't run anything besides unusable 8B models.

Anonymous

Anonymous Wed 05 Jun 2024 23:56:50 No.100830883 Report

Quoted By: >>100830896 >>100830902 >>100830907 >>100830956 >>100830963 >>100830970 >>100830981 >>100831009 >>100831053 >>100833351

>Zero accuracy loss of INT4 model, even comparing with FP16 model.
Big if true.
https://x.com/HaihaoShen/status/1798328753250271704

Anonymous

View Same Google ImgOps iqdb SauceNAO cap.jpg, 322KiB, 1280x780

Anonymous Wed 05 Jun 2024 23:57:26 No.100830891 Report

Quoted By: >>100830929 >>100830977 >>100831018

Here's an example of the expert prompting technique outlined somewhere above, this was generated by Wizard LM 8x22. I actually think Sillytavern is not practical to do really complex prompting. We need something like ComfyUI for LLMs. Also yes, bird women. I am a victim of 80s cartoons, ok?

Anonymous

Anonymous Wed 05 Jun 2024 23:57:54 No.100830896 Report

Quoted By: >>100830924

>>100830883
>haihaoshen
nothingburger.

Anonymous

Anonymous Wed 05 Jun 2024 23:58:02 No.100830902 Report

Quoted By: >>100830916

>>100830883
>INT4 model, even comparing with FP16 model
bullshit until proven otherwise

Anonymous

Anonymous Wed 05 Jun 2024 23:58:22 No.100830907 Report

Quoted By:

>>100830883
>delete all layers except the ones used in benchmarks
>WOW!!! ZERO LOSS!!

Anonymous

Anonymous Wed 05 Jun 2024 23:59:00 No.100830916 Report

Quoted By:

>>100830902
here's the proof:

https://x.com/HaihaoShen/status/1798328753250271704

Anonymous

View Same Google ImgOps iqdb SauceNAO 1700065889220594.png, 12KiB, 634x102

Anonymous Wed 05 Jun 2024 23:59:26 No.100830924 Report

Quoted By: >>100830956

>>100830896
The int4 models are published by Intel though

Anonymous

Anonymous Wed 05 Jun 2024 23:59:52 No.100830929 Report

Quoted By: >>100830979

>>100830891
This is stupid, if you will just let LLM do everything for you you should just read a book

Anonymous

Anonymous Thu 06 Jun 2024 00:01:50 No.100830956 Report

Quoted By:

>>100830924
>>100830883
Now if only it was 2 bit. 4bit 400B still aint going to fit in 100GB.

Anonymous

Anonymous Thu 06 Jun 2024 00:02:32 No.100830963 Report

Quoted By:

>>100830883
That made me think. Is there a training method where you sort of do dropout on weight precision for quanting? Like you finished training the model and then you continue training but gradually start to reduce precision of random weights until you end up with some bpw or even static bit size?

Anonymous

Anonymous Thu 06 Jun 2024 00:02:58 No.100830970 Report

Quoted By: >>100830999

>>100830883
The non-meme question is, llama quants where? Or first I should ask what the fuck is that.

Anonymous

Anonymous Thu 06 Jun 2024 00:03:29 No.100830977 Report

Quoted By:

>>100830891
How did you do this?

Anonymous

Anonymous Thu 06 Jun 2024 00:03:46 No.100830979 Report

Quoted By:

>>100830929
What if the book you want to read hasn't been written?

Anonymous

Anonymous Thu 06 Jun 2024 00:03:50 No.100830981 Report

Quoted By: >>100831034

>>100830883
Bitnet bros, I don't feel so good...
Where is paper tho

Anonymous

Anonymous Thu 06 Jun 2024 00:05:38 No.100830999 Report

Quoted By: >>100831012 >>100831026

>>100830970
Come on. You know the answer. shit wont be reproducible. if all the shit that was presented like this turned out to be anything but a nothingburger, we'd have 1 billion token context models running on 6 gb cards. It always ends up being a wet fart. Field moves fast, but not that fast. Some rules cannot be broken.

Anonymous

View Same Google ImgOps iqdb SauceNAO 1695169763209758.png, 34KiB, 775x657

Anonymous Thu 06 Jun 2024 00:06:19 No.100831009 Report

Quoted By: >>100831059

>>100830883
no way this is real.
https://huggingface.co/Intel/SOLAR-10.7B-Instruct-v1.0-int4-inc

Anonymous

Anonymous Thu 06 Jun 2024 00:06:34 No.100831012 Report

Quoted By:

>>100830999
Yeah, it's so over, I guess we should go to /caig/...

Anonymous

Anonymous Thu 06 Jun 2024 00:06:50 No.100831016 Report

Quoted By: >>100831082 >>100831093

Can I make a thread too?

Anonymous

Anonymous Thu 06 Jun 2024 00:06:56 No.100831018 Report

Quoted By:

>>100830891
kino

Anonymous

View Same Google ImgOps iqdb SauceNAO 1686361512026701.png, 56KiB, 743x791

Anonymous Thu 06 Jun 2024 00:07:26 No.100831026 Report

Quoted By: >>100831056 >>100831059 >>100831079

>>100830999
>You know the answer. shit wont be reproducible
They say how to reproduce it as the first thing in the repo

Anonymous

Anonymous Thu 06 Jun 2024 00:08:11 No.100831034 Report

Quoted By: >>100831104 >>100831281

>>100830981
Paper is here: https://arxiv.org/abs/2309.05516

Anonymous

Anonymous Thu 06 Jun 2024 00:09:03 No.100831044 Report

Quoted By: >>100831068

>something big about lossless quantization drops
>suddenly /lmg/ become unusable due to thread spam
Damn, Sam Altman must be really scared about this one to have his bots try this hard.

Anonymous

Anonymous Thu 06 Jun 2024 00:09:26 No.100831053 Report

Quoted By:

>>100830883
so, how much in GBs ??

Anonymous

Anonymous Thu 06 Jun 2024 00:09:44 No.100831056 Report

Quoted By: >>100831107

>>100831026
If it really is that simple, that would make a lot of models really interesting for me that weren't so far. I'm gonna wait tho. I got burned too often. People in this sphere really have a tendency to make grandiose claims that later turned out to be flukes.

Anonymous

Anonymous Thu 06 Jun 2024 00:09:50 No.100831059 Report

Quoted By:

>>100831009
>>100831026
Dayum intel. Nice if true.
Wonder how the method can be expanded to apply to even lower bit sizes.
>https://huggingface.co/spaces/Intel/low_bit_open_llm_leaderboard
Never seen that.

Anonymous

Anonymous Thu 06 Jun 2024 00:10:40 No.100831068 Report

Quoted By:

>>100831044
>implying it was usable with mikufag spammer

Anonymous

Anonymous Thu 06 Jun 2024 00:11:29 No.100831079 Report

Quoted By:

>>100831026
makes you mostly wonder how much obvious useless crap the models seem to have.

Anonymous

Anonymous Thu 06 Jun 2024 00:11:48 No.100831082 Report

Quoted By:

>>100831016
No.

Anonymous

Anonymous Thu 06 Jun 2024 00:12:39 No.100831093 Report

Quoted By:

>>100831016
Yes.

Anonymous

Anonymous Thu 06 Jun 2024 00:13:26 No.100831104 Report

Quoted By:

>>100831034
>Large Language Models (LLMs) have demonstrated exceptional proficiency in language-related tasks, but their deployment poses significant challenges due to substantial memory and storage requirements. Weight-only quantization has emerged as a promising solution to address these challenges. Previous research suggests that fine-tuning through up and down rounding can enhance performance. In this study, we introduce SignRound, a method that utilizes signed gradient descent (SignSGD) to optimize rounding values and weight clipping within just 200 steps. SignRound integrates the advantages of Quantization-Aware Training (QAT) and Post-Training Quantization (PTQ), achieving exceptional results across 2 to 4 bits while maintaining low tuning costs and avoiding additional inference overhead. For example, SignRound achieves absolute average accuracy improvements ranging from 6.91\% to 33.22\% at 2 bits. It also demonstrates robust generalization to recent models and achieves near-lossless quantization in most scenarios at 4 bits.
2bit lossless when

Anonymous

Anonymous Thu 06 Jun 2024 00:13:26 No.100831106 Report

Quoted By:

It's not supported in Llama.cpp is it?

Anonymous

Anonymous Thu 06 Jun 2024 00:13:38 No.100831107 Report

Quoted By: >>100831217

>>100831056
At least this time it's published by an actually renowned company and by random chinks or some shady literally who AI startup that existed for two weeks.

Anonymous

Anonymous Thu 06 Jun 2024 00:20:05 No.100831181 Report

Quoted By: >>100831266

>>100828064
>c2
>more jailbreak than actual RP
Now that shows just how much of a skillet proxyfags are. A simple two word pre-fill is enough to remove all censorship from Claude 3.

Anonymous

Anonymous Thu 06 Jun 2024 00:22:36 No.100831217 Report

Quoted By:

>>100831107
I only see chink names in the paper but I havent slept in 24 hours so I might be misreading

Anonymous

Anonymous Thu 06 Jun 2024 00:27:12 No.100831266 Report

Quoted By:

>>100831181
Nah, it's simply not true.

Anonymous

Anonymous Thu 06 Jun 2024 00:28:45 No.100831281 Report

Quoted By: >>100831386

>>100831034
Shame they didn't test on L3. But it's probably not too hard to adapt the training code for it. The appendix says they ran 70B in 3 hours on 80GB VRAM, so 8B might be doable with only 24GB

Anonymous

Anonymous Thu 06 Jun 2024 00:32:46 No.100831319 Report

Quoted By: >>100831421

Interesting how this is published right when it came out that their new SoCs have considerable space dedicated for NPUs

Anonymous

Anonymous Thu 06 Jun 2024 00:39:08 No.100831386 Report

Quoted By: >>100831567

>>100831281
>8B 4bit.
Are you gonna load it on your psp?

Anonymous

Anonymous Thu 06 Jun 2024 00:40:58 No.100831403 Report

Quoted By:

They even did Mixtral and according to their numbers there was no performance loss. This would be amazing if really true. I always felt 4 bit models were retarded and didn't bother with them.

Anonymous

Anonymous Thu 06 Jun 2024 00:42:55 No.100831421 Report

Quoted By:

>>100831319
Nvidia just surpassed Apple, Intel is trying to do something about it.

Anonymous

Anonymous Thu 06 Jun 2024 00:44:14 No.100831437 Report

Quoted By: >>100831715

New Thread
>>100831430
>>100831430
>>100831430

Anonymous

Anonymous Thu 06 Jun 2024 00:48:29 No.100831498 Report

Quoted By:

>mikutard got swept away by jannies
So. How does it feel faggot? Did you like it?

Anonymous

Anonymous Thu 06 Jun 2024 00:52:17 No.100831543 Report

Quoted By:

whats with scientists and using ancient models like facebook opt

Anonymous

Anonymous Thu 06 Jun 2024 00:54:40 No.100831567 Report

Quoted By:

>>100831386
The point is to make sure it works before putting any serious effort into it

Anonymous

Anonymous Thu 06 Jun 2024 01:02:35 No.100831659 Report

Quoted By: >>100831688

So these new 9B models aren't just a copy paste of llama, meaning that Llama.cpp and the like need to implement new code to run it, correct?

Anonymous

Anonymous Thu 06 Jun 2024 01:04:40 No.100831684 Report

Quoted By: >>100831695

Local models status?

Anonymous

Anonymous Thu 06 Jun 2024 01:05:05 No.100831688 Report

Quoted By:

>>100831659
*spits directly into your mouth as you're speaking*

Anonymous

Anonymous Thu 06 Jun 2024 01:05:48 No.100831695 Report

Quoted By:

>>100831684
dead

Anonymous

Anonymous Thu 06 Jun 2024 01:07:53 No.100831715 Report

Quoted By:

>>100831437
Its over

Anonymous

Anonymous Thu 06 Jun 2024 01:11:40 No.100831748 Report

Quoted By: >>100831965

New Thread
>>100831740
>>100831740
>>100831740

Anonymous

Anonymous Thu 06 Jun 2024 01:21:55 No.100831855 Report

Quoted By:

this OP spam is pathetic

Anonymous

Anonymous Thu 06 Jun 2024 01:31:13 No.100831965 Report

Quoted By:

>>100831748
stop spamming

Anonymous

Anonymous Thu 06 Jun 2024 02:56:31 No.100832807 Report

Quoted By:

It's getting dark.
I think... I think it's time to let it all go. I see a blue glow. Or is it red? It's both. I don't want to follow them, now do I want to stay here alone. I will stay. It's not worth the struggle.
Come, nothing, come.

Anonymous

Anonymous Thu 06 Jun 2024 03:48:25 No.100833351 Report

Quoted By:

>>100830883
Nothingburger.

Anonymous

Anonymous Thu 06 Jun 2024 04:20:11 No.100833601 Report

Quoted By:

I don't know if it's my prompt set up, but Stheno 8B is actually decently forward, in that it doesn't just play chicken to the heat death of the universe if I don't push the plot forward myself.
Not bad, for an 8B.

Subject

Name

E-mail

Password

Capcode	All Only User Posts Only Verified Posts Only Moderator Posts Only Manager Posts Only Admin Posts Only Developer Posts Only Founder Posts
Show Posts	All Only With Images Only Without Images Only Spoiler Images Only Non-Spoiler Images
Deleted Posts	All Only Deleted Posts Only Non-Deleted Posts
Ghost Posts	All Only Ghost Posts Only Non-Ghost Posts
Post Type	All Only Sticky Threads Only Opening Posts Only Reply Posts
Results	All Grouped By Threads
Order	Latest Posts First Oldest Posts First

On these archives

On these boards

Your latest searches

/lmg/ - Local Models General