INDEX
Explanations
phrases and terms related to personal growth and educational experiences
New Auto-Interp
Negative Logits
ataka
-0.16
ottes
-0.15
drive
-0.15
etect
-0.15
_accessible
-0.14
пеÑĢеÑģ
-0.14
yx
-0.14
gent
-0.14
ocup
-0.14
ortex
-0.14
POSITIVE LOGITS
abroad
0.39
overseas
0.31
Foreign
0.30
foreign
0.30
Foreign
0.25
foreign
0.24
FOREIGN
0.24
Overse
0.23
foreigners
0.22
language
0.19
Activations Density 0.080%