INDEX
Explanations
references to locations and medical environments
New Auto-Interp
Negative Logits
ɚ
-0.59
esternos
-0.58
idéia
-0.58
万美元
-0.57
ameryka
-0.56
inigte
-0.55
Američ
-0.55
Amerikaanse
-0.55
amerikanischen
-0.54
iastes
-0.54
POSITIVE LOGITS
UK
1.39
British
1.34
England
1.25
英国
1.25
Britain
1.24
London
1.22
British
1.18
UK
1.17
£
1.11
イギリス
1.10
Activations Density 2.189%