INDEX
Explanations
currency amounts in British pounds
New Auto-Interp
Negative Logits
oten
-0.15
اتÙĩ
-0.14
ereotype
-0.14
584
-0.14
ullen
-0.13
lovak
-0.13
ırak
-0.13
ubbo
-0.13
adlo
-0.13
226
-0.13
POSITIVE LOGITS
anness
0.16
anity
0.16
rain
0.15
ियत
0.15
æł¼
0.14
aving
0.14
kok
0.14
bÃŃr
0.14
fw
0.14
Rol
0.14
Activations Density 0.014%