INDEX
Explanations
prepositions indicating a spatial or conceptual relationship
concepts related to independence and separation
New Auto-Interp
Negative Logits
ãĤª
-0.83
eas
-0.75
Keys
-0.74
oji
-0.72
deal
-0.70
hiba
-0.70
cussion
-0.69
kef
-0.68
idav
-0.68
cho
-0.66
POSITIVE LOGITS
ours
1.11
itself
0.88
yours
0.88
another
0.85
theirs
0.83
other
0.82
hers
0.81
existing
0.81
the
0.80
others
0.78
Activations Density 0.384%