INDEX
Explanations
technical terms related to scientific figures or diagrams
references to figures or illustrations in a text
New Auto-Interp
Negative Logits
die
-0.71
ãĥ£
-0.68
lockout
-0.68
̶
-0.65
campus
-0.64
qua
-0.64
zone
-0.62
thro
-0.62
wcs
-0.61
hate
-0.61
POSITIVE LOGITS
1
0.93
below
0.88
Figure
0.86
Contents
0.85
2
0.83
¶
0.82
4
0.81
summarizes
0.80
Figure
0.80
3
0.80
Activations Density 0.036%