INDEX
Explanations
terms related to comparison or contrast
repetitive phrases that denote existence or presence
New Auto-Interp
Negative Logits
ocracy
-0.75
unfocusedRange
-0.72
iew
-0.67
ographer
-0.66
ocaust
-0.65
gow
-0.64
inav
-0.63
afety
-0.63
ioch
-0.62
teness
-0.62
POSITIVE LOGITS
respectively
1.43
trademarks
1.22
mutually
1.12
alike
1.11
examples
1.08
jointly
1.03
staples
1.00
both
1.00
fronts
0.99
pillars
0.98
Activations Density 0.233%