INDEX
Explanations
locations such as cities or countries
occurrences of the word "in" along with associated location phrases
New Auto-Interp
Negative Logits
certs
-0.87
Chuck
-0.85
wolves
-0.78
NFL
-0.77
UGH
-0.76
keyes
-0.73
Kids
-0.72
Feed
-0.70
Skip
-0.68
STEM
-0.68
POSITIVE LOGITS
Istanbul
1.80
Kuala
1.69
Ankara
1.66
Budapest
1.65
Tehran
1.63
Vienna
1.59
Cairo
1.57
Jakarta
1.57
Madrid
1.57
Bangkok
1.56
Activations Density 0.268%