INDEX
Explanations
academic achievements and educational progress
New Auto-Interp
Negative Logits
rou
-0.16
isted
-0.14
kker
-0.14
acas
-0.14
Friendship
-0.14
rouch
-0.13
ÑĤÑĥ
-0.13
assignable
-0.13
Ther
-0.13
iggins
-0.13
POSITIVE LOGITS
college
0.20
college
0.18
colleges
0.17
дал
0.16
Ivy
0.16
ucher
0.16
continuation
0.16
rada
0.15
Engineer
0.15
further
0.15
Activations Density 0.055%