INDEX
Explanations
phrases indicating spatial relationships or locations relative to objects
New Auto-Interp
Negative Logits
ſelf
-0.66
hdashline
-0.65
orianCalendar
-0.65
Efq
-0.62
Cæsar
-0.61
usermodel
-0.61
ſelves
-0.60
felves
-0.60
neſs
-0.56
houſe
-0.55
POSITIVE LOGITS
فريبيس
0.77
behind
0.74
neath
0.70
beneath
0.70
Behind
0.66
derrière
0.66
Behind
0.64
BEHIND
0.64
debajo
0.63
underneath
0.62
Activations Density 0.302%