INDEX
Explanations
sentences containing various forms of the verb "to be" and prepositions indicating relationships between objects
New Auto-Interp
Negative Logits
Chapman
-0.18
323
-0.15
Rig
-0.15
ire
-0.15
gt
-0.15
coli
-0.15
ship
-0.14
ských
-0.14
cape
-0.14
_PROTO
-0.14
POSITIVE LOGITS
acket
0.17
nemonic
0.17
bow
0.16
aat
0.15
umbn
0.15
âĶĺ
0.15
uste
0.15
onaut
0.15
ÃŃna
0.15
udu
0.14
Activations Density 0.002%