INDEX
Explanations
mentions of specific locations, especially related to events or news
New Auto-Interp
Negative Logits
arist
-0.80
naire
-0.78
abet
-0.77
arse
-0.75
DonaldTrump
-0.73
OHN
-0.73
toggle
-0.73
ischer
-0.72
ayne
-0.70
osit
-0.69
POSITIVE LOGITS
Harbour
1.19
Morning
0.98
Lumpur
0.98
Opera
0.94
Harbor
0.92
Wand
0.82
harbour
0.78
FC
0.72
suburbs
0.70
Airport
0.70
Activations Density 0.016%