INDEX
Explanations
references to the University of Oxford and its related institutions
New Auto-Interp
Negative Logits
ochen
-0.16
atel
-0.15
aji
-0.15
afc
-0.14
ONO
-0.14
ÎŃÏģγ
-0.14
Vinci
-0.14
μÏĨ
-0.13
ITERAL
-0.13
ç¾½
-0.13
POSITIVE LOGITS
ian
0.21
shire
0.18
onian
0.17
inden
0.15
avier
0.15
ram
0.15
kê
0.14
ÑĤин
0.14
IAN
0.14
ุม
0.14
Activations Density 0.025%