INDEX
Explanations
occurrences of the word "identify" and its variations
New Auto-Interp
Negative Logits
our
-0.19
ums
-0.17
Faction
-0.16
ala
-0.15
istry
-0.15
ook
-0.15
/if
-0.14
-Israel
-0.14
asse
-0.14
háºŃu
-0.14
POSITIVE LOGITS
undef
0.16
entities
0.16
urs
0.15
ropri
0.14
twin
0.14
emp
0.14
ifiant
0.14
rõ
0.14
ENTITY
0.14
key
0.14
Activations Density 0.022%