INDEX
Explanations
proper nouns and names
references to prominent figures in images or captions related to events and contexts
New Auto-Interp
Negative Logits
rall
-0.79
desk
-0.63
comprom
-0.63
forwards
-0.63
erect
-0.62
quar
-0.62
pageant
-0.62
stru
-0.62
fences
-0.62
wedd
-0.60
POSITIVE LOGITS
2
1.21
2
1.11
Secondly
0.99
²
0.99
Secondly
0.92
Second
0.86
Second
0.84
II
0.83
0002
0.82
second
0.82
Activations Density 0.125%