INDEX
Explanations
specific named entities or proper nouns related to technology products
occurrences of the word "the" and its variations in various contexts
New Auto-Interp
Negative Logits
racial
-0.75
AIDS
-0.74
Soros
-0.73
UNCLASSIFIED
-0.71
ÃŃ
-0.70
claw
-0.70
pursuant
-0.70
911
-0.69
.''
-0.68
leground
-0.68
POSITIVE LOGITS
cheapest
1.22
simplest
1.16
latest
1.16
aforementioned
1.16
coolest
1.16
easiest
1.15
biggest
1.14
smallest
1.12
latter
1.11
lowest
1.08
Activations Density 0.640%