INDEX
Explanations
references to locations and positions, particularly things that are underneath or below other objects
New Auto-Interp
Negative Logits
Ïģθ
-0.16
ill
-0.15
anche
-0.15
izr
-0.15
point
-0.14
onis
-0.14
ữ
-0.14
综åIJĪ
-0.13
ryo
-0.13
asil
-0.13
POSITIVE LOGITS
neath
0.24
cover
0.15
cover
0.15
lord
0.14
838
0.14
lords
0.14
λοι
0.14
istrovstvÃŃ
0.14
768
0.14
attack
0.14
Activations Density 0.032%