INDEX
Explanations
occurrences of the word "identify" and its variations, highlighting themes of self-identification and categorization
New Auto-Interp
Negative Logits
our
-0.18
ums
-0.16
-Israel
-0.15
Faction
-0.14
owing
-0.14
izia
-0.14
sta
-0.14
/if
-0.14
istry
-0.14
ull
-0.14
POSITIVE LOGITS
entities
0.17
undef
0.17
ENTITY
0.15
agnost
0.15
ropri
0.15
emp
0.15
urs
0.15
twin
0.14
ifiant
0.14
aho
0.14
Activations Density 0.022%