INDEX
Explanations
terms related to government, politics, espionage, and military activities
proper nouns and specific group identifiers related to entities and demographics
New Auto-Interp
Negative Logits
eanor
-0.75
ibaba
-0.73
uyomi
-0.69
vable
-0.67
itars
-0.67
chwitz
-0.65
accessible
-0.63
positives
-0.62
comings
-0.62
ashtra
-0.62
POSITIVE LOGITS
º
0.60
¦
0.59
contractor
0.59
Writers
0.59
Logged
0.57
guiActiveUn
0.56
intermediary
0.56
commentator
0.56
ADA
0.56
eman
0.56
Activations Density 0.386%