INDEX
Explanations
references to individuals in various contexts and events
New Auto-Interp
Negative Logits
icha
-0.17
ropp
-0.16
kud
-0.14
æŃ
-0.14
ç»
-0.14
adoo
-0.13
æĪª
-0.13
figur
-0.13
ิà¹Ī
-0.13
ãĥ¼ãĥĢ
-0.12
POSITIVE LOGITS
center
0.57
centre
0.54
left
0.52
right
0.43
center
0.43
far
0.42
centre
0.41
left
0.41
-center
0.38
foreground
0.38
Activations Density 0.099%