INDEX
Explanations
introducing listings or emotional states
New Auto-Interp
Negative Logits
തേ
0.46
المه
0.45
दोग
0.45
ভুট
0.42
ultz
0.40
ال
0.40
旷
0.39
contraception
0.38
Sp
0.38
ോക
0.38
POSITIVE LOGITS
Warm
0.44
excited
0.43
RAMM
0.42
excit
0.41
agitated
0.41
grim
0.40
Excited
0.40
fortiter
0.39
umerate
0.39
frightened
0.39
Activations Density 0.001%