INDEX
Explanations
references to students in different academic years
New Auto-Interp
Negative Logits
sophomore
-0.20
Graduate
-0.19
_gradients
-0.19
youthful
-0.18
Senior
-0.17
-gradient
-0.16
senior
-0.16
_gradient
-0.16
Senior
-0.16
freshman
-0.16
POSITIVE LOGITS
-year
0.21
year
0.18
itis
0.17
-level
0.17
اÙĩ
0.17
omore
0.17
year
0.16
-aged
0.16
Bridges
0.15
unde
0.15
Activations Density 0.021%