INDEX
Explanations
academic institutions and departments related to science
references to various academic disciplines related to sciences
New Auto-Interp
Negative Logits
adding
-0.73
template
-0.66
own
-0.62
acting
-0.62
leg
-0.62
usher
-0.61
deserted
-0.60
aggressive
-0.60
shr
-0.60
irth
-0.60
POSITIVE LOGITS
Sciences
1.32
sciences
1.05
terday
0.86
Profession
0.85
istries
0.82
Neuroscience
0.82
Pwr
0.80
Laboratories
0.79
pecially
0.78
udo
0.76
Activations Density 0.005%