INDEX
Explanations
words related to legal and political contexts, especially related to individuals and their actions
New Auto-Interp
Negative Logits
Interstitial
-0.70
¥µ
-0.67
TERN
-0.64
âĸ¬âĸ¬
-0.63
animate
-0.63
frey
-0.62
stration
-0.60
ãĥ¬
-0.60
ORGE
-0.59
cemic
-0.59
POSITIVE LOGITS
ernel
0.88
ed
0.85
irts
0.82
atchewan
0.79
ozy
0.78
edIn
0.78
mallow
0.76
er
0.76
itudinal
0.75
hoff
0.71
Activations Density 5.366%