INDEX
Explanations
references to countries
country followed by specific nouns
New Auto-Interp
Negative Logits
ontas
-0.53
Diweddarwch
-0.52
éal
-0.50
OSS
-0.48
őtt
-0.47
️
-0.47
bootstrapcdn
-0.47
kosh
-0.47
römischen
-0.47
elance
-0.46
POSITIVE LOGITS
wide
1.22
side
0.98
WIDE
0.95
Wide
0.81
wide
0.74
club
0.68
club
0.66
bump
0.64
SIDE
0.64
side
0.62
Activations Density 0.046%