INDEX
Explanations
intense expressions of frustration or strong emotions
New Auto-Interp
Negative Logits
uclear
-0.71
��
-0.68
و
-0.66
uid
-0.65
ablo
-0.63
cano
-0.62
Canary
-0.62
intention
-0.59
infl
-0.59
vas
-0.58
POSITIVE LOGITS
acebook
0.75
Lex
0.70
Kats
0.69
east
0.68
KK
0.64
lass
0.63
etsu
0.61
Maple
0.61
Drop
0.61
APD
0.61
Activations Density 0.048%