INDEX
Explanations
proper nouns, particularly names and specific places
New Auto-Interp
Negative Logits
antha
-0.15
acro
-0.15
owski
-0.15
sos
-0.14
weed
-0.14
pars
-0.14
mainwindow
-0.14
-fi
-0.14
رÙĪØ´
-0.14
eskort
-0.13
POSITIVE LOGITS
à¥ĩय
0.16
weather
0.15
trap
0.14
Collapse
0.14
385
0.13
emin
0.13
eteria
0.13
fragile
0.13
ustain
0.13
ازÙħ
0.13
Activations Density 0.108%