INDEX
Explanations
references to Colombia and its cultural elements
New Auto-Interp
Negative Logits
Flint
-0.18
Hakk
-0.16
bens
-0.15
407
-0.15
aphore
-0.15
zdy
-0.15
ysl
-0.15
878
-0.14
ää
-0.14
ENTA
-0.14
POSITIVE LOGITS
Colombia
0.38
colomb
0.35
Colombian
0.35
Colomb
0.34
Columbia
0.32
Bog
0.30
Colum
0.27
olumbia
0.25
Col
0.24
-Col
0.23
Activations Density 0.024%