INDEX
Explanations
references to specific universities, particularly those in California
New Auto-Interp
Negative Logits
ize
-0.16
Wolverine
-0.14
ane
-0.14
инок
-0.14
Esk
-0.13
ixe
-0.13
oi
-0.13
aska
-0.13
classic
-0.13
.uf
-0.13
POSITIVE LOGITS
Hayward
0.26
Doming
0.23
Stan
0.22
Long
0.21
Channel
0.20
Full
0.20
North
0.19
Pom
0.18
-dom
0.18
(Long
0.18
Activations Density 0.003%