INDEX
Explanations
mention of figures or references within a text
references to figures or illustrations in the text
New Auto-Interp
Negative Logits
ophobic
-0.65
banned
-0.64
ocracy
-0.63
dictionary
-0.62
wars
-0.61
admin
-0.60
broadcast
-0.60
supers
-0.59
multicultural
-0.59
classics
-0.58
POSITIVE LOGITS
Fig
3.97
Fig
3.08
FIG
2.11
fig
2.03
Figure
2.01
fig
1.91
FIG
1.72
Figure
1.70
Figures
1.68
Table
1.35
Activations Density 0.028%