INDEX
Explanations
words related to opposition or resistance
words related to assumptions or beliefs
New Auto-Interp
Negative Logits
SEA
-0.85
Masters
-0.83
AMS
-0.83
GER
-0.80
Timber
-0.74
DER
-0.73
Abyssal
-0.72
Forest
-0.72
Schneider
-0.72
Rite
-0.71
POSITIVE LOGITS
supp
1.30
scrut
1.05
lication
0.99
osition
0.95
ressive
0.94
pse
0.92
orter
0.90
roleum
0.86
uration
0.85
plaus
0.84
Activations Density 0.008%