INDEX
Explanations
prepositions indicating distance or direction
New Auto-Interp
Negative Logits
©¶æ¥µ
-0.88
Lic
-0.72
laughter
-0.71
few
-0.71
wang
-0.69
IER
-0.67
vic
-0.67
fif
-0.67
had
-0.66
soon
-0.66
POSITIVE LOGITS
afar
0.79
home
0.77
civilisation
0.74
shore
0.73
whence
0.72
ables
0.70
civilization
0.70
them
0.70
caring
0.69
anything
0.69
Activations Density 0.068%