INDEX
Explanations
highly positive statements or exclamations
strong positive affirmations and expressions of excitement
New Auto-Interp
Negative Logits
pec
-0.77
otom
-0.74
onde
-0.73
abled
-0.73
contracted
-0.72
odox
-0.70
ascus
-0.70
istant
-0.69
pta
-0.68
BUS
-0.67
POSITIVE LOGITS
Especially
1.18
Beaut
0.94
Particularly
0.91
Lots
0.90
GIF
0.90
Literally
0.83
Including
0.82
Kills
0.81
Especially
0.80
Awesome
0.79
Activations Density 0.395%