INDEX
Explanations
references to specific college programs and their associated individuals or titles
New Auto-Interp
Negative Logits
jedn
-0.08
sayıda
-0.07
ãģ°ãģĭãĤĬ
-0.07
dae
-0.07
xae
-0.07
ceb
-0.07
billig
-0.07
pron
-0.07
ẫ
-0.07
èĮ
-0.07
POSITIVE LOGITS
201
0.15
199
0.12
200
0.11
198
0.11
197
0.10
202
0.10
196
0.09
0.09
195
0.07
March
0.07
Activations Density 0.038%