INDEX
Explanations
references to proximity or location of objects or people in relation to one another
New Auto-Interp
Negative Logits
houſe
-0.88
Houſe
-0.85
ſelf
-0.82
pleaſure
-0.82
purpoſe
-0.80
Majefty
-0.80
脚注の使い方
-0.79
Diſ
-0.76
ſelves
-0.76
perſon
-0.76
POSITIVE LOGITS
beside
0.97
obok
0.95
adjacent
0.85
accanto
0.81
opposite
0.79
рядом
0.78
Beside
0.78
nearby
0.74
vicino
0.71
Nearby
0.71
Activations Density 0.176%