INDEX
Explanations
phrases mentioning academic journals and institutions
references to academic institutions or journals
New Auto-Interp
Negative Logits
agascar
-0.68
piles
-0.67
crate
-0.66
bnb
-0.65
brush
-0.65
wolves
-0.64
channelAvailability
-0.62
forth
-0.62
residue
-0.61
haze
-0.61
POSITIVE LOGITS
Sciences
1.27
Arts
1.16
Science
1.01
Pediatrics
0.91
Letters
0.83
Veterinary
0.82
Engineering
0.79
Scientists
0.77
Literature
0.75
arts
0.75
Activations Density 0.050%