INDEX
Explanations
phrases indicating certainty or assurance
affirmative phrases or confirmations
New Auto-Interp
Negative Logits
âĵĺ
-0.75
RAW
-0.68
LESS
-0.67
utenberg
-0.67
inational
-0.67
UCHIJ
-0.65
inventoryQuantity
-0.64
natureconservancy
-0.64
mercial
-0.63
âĸº
-0.60
POSITIVE LOGITS
terday
0.77
ndra
0.73
hea
0.72
Virtue
0.71
ties
0.70
xus
0.70
ardon
0.70
sure
0.69
antha
0.68
enough
0.67
Activations Density 0.024%