INDEX
Explanations
occurrences of the word "University."
New Auto-Interp
Negative Logits
cura
-0.18
inia
-0.17
throp
-0.16
ibri
-0.15
chez
-0.15
voks
-0.14
adders
-0.14
487
-0.14
iband
-0.14
æĺĮ
-0.14
POSITIVE LOGITS
alet
0.16
GN
0.16
ftime
0.15
ODEV
0.14
offense
0.13
specialchars
0.13
ITER
0.13
trab
0.13
FLASH
0.13
ivet
0.13
Activations Density 0.020%