INDEX
Explanations
references to the UK and British-related terms
New Auto-Interp
Negative Logits
ValueGeneration
-0.85
verru
-0.72
poin
-0.70
Oru
-0.69
sendMail
-0.69
Verr
-0.68
kapa
-0.68
<h3>
-0.68
ress
-0.68
Imo
-0.67
POSITIVE LOGITS
UK
1.14
britannien
1.14
Britain
1.10
Brito
1.08
BRITAIN
1.04
Brit
1.03
Unido
1.00
Brit
0.98
-£
0.98
GBP
0.96
Activations Density 0.071%