INDEX
Explanations
instances of verbs related to evaluation and observation
New Auto-Interp
Negative Logits
imu
-0.06
anken
-0.06
edd
-0.06
örper
-0.06
IT
-0.06
ITT
-0.06
id
-0.06
ep
-0.06
afa
-0.06
ernen
-0.05
POSITIVE LOGITS
them
0.19
å®ĥ
0.18
it
0.17
å®ĥ们
0.17
thereof
0.17
its
0.16
оно
0.16
nó
0.16
them
0.14
ниÑħ
0.13
Activations Density 0.003%