INDEX
Explanations
No Explanations Found
New Auto-Interp
Negative Logits
stupid
1.28
annoyed
1.14
sowas
1.09
supposed
1.07
Basically
1.07
Probably
1.06
fucking
1.05
Whenever
1.03
Whenever
1.01
يعني
1.00
POSITIVE LOGITS
inéd
1.44
handcrafted
1.42
showcase
1.42
lumin
1.42
showcased
1.41
最新
1.40
curated
1.40
innovative
1.35
作品
1.35
新作
1.35
Activations Density 0.785%