INDEX
Explanations
references to different regions and their economic status in relation to the UK
New Auto-Interp
Negative Logits
ibur
-0.19
ibri
-0.18
bourg
-0.18
Kendrick
-0.15
acman
-0.14
ÅĦ
-0.14
sw
-0.14
uhan
-0.14
ches
-0.14
ãģ¾ãģ§
-0.14
POSITIVE LOGITS
alg
0.17
dete
0.17
van
0.16
vanished
0.15
Ã¶ÃŁe
0.15
dice
0.15
adol
0.14
egment
0.14
çĭ
0.14
ominator
0.14
Activations Density 0.256%