INDEX
Explanations
phrases related to the presence or absence of specific qualities or items
New Auto-Interp
Negative Logits
Julien
-0.69
rehearing
-0.69
jetunion
-0.66
Peptide
-0.65
steeper
-0.65
nadie
-0.65
jazdy
-0.64
Мексичка
-0.63
flats
-0.63
Kart
-0.63
POSITIVE LOGITS
presence
2.45
presence
2.21
Presence
2.16
Presence
2.08
présence
1.99
presencia
1.74
presença
1.63
presenza
1.58
присут
1.53
PRES
1.46
Activations Density 0.148%