BLUE

Nnafnlaus.bsky.socialOct 16, 2024 12:54pm

Agreed. This coversation has me convinced that not only is this not an ideal solution, but ideal solutions are probably already being implemented in improved FP8 ops in new hardware. FP8s don't need to be accurate. But there are better ways to do this.

PLpekka.bsky.socialOct 16, 2024 11:24am

E.g. 1.875*1.875 seems to be among the worst case scenarios with fp8 for the freshly proposed algorithm and results >18% error (fp16 can have like 24% errors). The even older ApproxLP algorithm discussed in that older paper, that similarly removes multiplication, is <0.5% off for that.

PLpekka.bsky.socialOct 16, 2024 12:21am

But according to their results there's minimal difference between this approximation (in fp8?) and regular fp16. Which seems surprising. Also makes me wonder if they have chosen to publish only tests where the results were favorable.

Nnafnlaus.bsky.socialOct 16, 2024 12:15am

I wouldn't trust something like this for anything higher than FP8. But FP8 and lower are great for inference. Just pretty useless for training!

PLpekka.bsky.socialOct 16, 2024 12:13am

Error for same values would still be something like 5.5% for FP8_e4m3, I think. And while they mostly talk about FP8, there are all kinds of comparisons to FP16, and the claimed differences seem to be surprisingly small.

Nnafnlaus.bsky.socialOct 16, 2024 12:06am

This paper is talking about FP8 and less. This is only useful for inference, not training. You need high precision for training.

almondfish.bsky.socialOct 15, 2024 7:53pm

ベースモデルはFLUX.1 dev fp8です。

almondfish.bsky.socialOct 14, 2024 10:11am

まず絵をイメージして文を書きます。 DeepL翻訳にかけます。少し文章の形を整えます。画像サイズに引っ張られることを意識しながら縦か横か決めます。生成ボタンを押して1分くらいでこの通りです。詳しくはALT欄やTensor Artの私の投稿を見てください。

$Prompt: an android girl is standing on the platform in countryside station. Platform height is the same height as train doors. A local train is stopping.sunny. Trains: Japanese National Railways.stainless-steel train on indigo stripes,2 headlights,a through door at the front of the train. The width of the track is 1067 mm. The width of the train body is 2900mm. android girl:shoulder-length ponytail metallic silver hair with diagonal bangs,glossy brown eyes,mechanical joints,mirror-polished metallic silver body with reflection of surrounding objects, Style: anime face,photorealistic body,highly detailed,clean lines,. Background: station, Accessory: travel bag on her shoulder., Overall impression: stylish fusion of traditional Asian slender aesthetics., \mechako\,noc-detail, Negative prompt: verybadimagenegative_v1.3, Steps: 25, Sampler: Euler, KSampler: euler, Schedule: normal, CFG scale: 1, Guidance: 4, Seed: 2276569595, Size: 1152x768, VAE: Automatic, Denoising strength: 0, Clip skip: 2, Model: Raemu-flux1-fp8-baked, LoRA: ba052f5f-1611-4abf-b3ba-e362effa72da.TA_trained:0.80,nocdetail3-000003:0.40$

m2btos.bsky.socialOct 13, 2024 11:05am

ローカルですよね～。モデルですね。私が使っているFLUXモデルはflux1-dev-fp8で、モンモンさん曰く、エロは規制が入り生成出来ないって言ってました。確かに、陰毛透けパンティのプロンプトいれても、レースパンティ止まりです。私もエロに対応出来るFLUXモデルをさがしてます。いやいや、ゆさおさんが本気をだしたら、確実に負けます。（勝手に勝負事になってしまいました。すみません。）

johonotodai.bsky.socialOct 12, 2024 10:23pm

#nvidia #blackwelldgxb200 #ai #generativeai #gpu #hbm3e #fp8 #fp4 #supercomputing #aiworkload youtu.be/5_TO2qxT39g

NVIDIA、8つのBlackwell GPUを搭載した「DGX B200」発表（価格も）

YouTube video by 情報の灯台