INDEX
Explanations
references to the Oxford University or its associated entities
New Auto-Interp
Negative Logits
lli
-0.71
moder
-0.70
Kran
-0.65
NIM
-0.62
🔥🔥
-0.60
ê
-0.59
ilet
-0.58
Ile
-0.58
بيها
-0.58
י
-0.57
POSITIVE LOGITS
Oxford
1.57
Oxford
1.51
OXFORD
1.40
oxford
1.38
oxford
1.09
Oxfordshire
0.96
trouw
0.93
'\\;'
0.89
OX
0.84
nakalista
0.81
Activations Density 0.002%