INDEX
Explanations
references to academic or university administrative topics and discussions
New Auto-Interp
Negative Logits
superf
-0.15
ãĥ¼ãĥĢ
-0.15
ova
-0.15
chner
-0.15
ahn
-0.14
ersiz
-0.14
еним
-0.14
zin
-0.14
ë°Ľ
-0.14
Lem
-0.14
POSITIVE LOGITS
vice
0.30
Vice
0.29
VC
0.26
vars
0.25
vice
0.23
synd
0.21
hostel
0.21
Dean
0.20
senate
0.20
Synd
0.20
Activations Density 0.033%