INDEX
Explanations
North, South, West followed by regions/countries
New Auto-Interp
Negative Logits
Lieblings
0.54
queer
0.51
Larson
0.50
Dul
0.50
Star
0.49
hippocampal
0.49
Parker
0.49
మంచి
0.48
Star
0.47
Ultr
0.47
POSITIVE LOGITS
Europe
1.17
Australia
1.09
Italy
1.08
Asia
1.07
Germany
1.05
Africa
1.04
India
1.02
countries
1.00
Spain
0.99
Europe
0.99
Activations Density 0.107%