INDEX
Explanations
words related to direction or orientation
the word "ward" in various contexts related to directionality
New Auto-Interp
Negative Logits
ocular
-0.91
acea
-0.84
obook
-0.81
gdala
-0.78
anan
-0.71
tein
-0.70
colo
-0.68
azo
-0.68
coni
-0.66
itu
-0.63
POSITIVE LOGITS
robe
0.99
ward
0.93
ness
0.87
wards
0.82
ly
0.81
nesses
0.79
abouts
0.74
ments
0.72
Bound
0.71
Spiral
0.68
Activations Density 0.018%