INDEX
Explanations
words related to geographical locations, specifically focusing on Cambridge
mentions of the city Cambridge
New Auto-Interp
Negative Logits
ued
-0.74
ore
-0.67
idols
-0.67
lava
-0.65
Romo
-0.65
upd
-0.64
lol
-0.64
idol
-0.64
Nightmare
-0.63
jer
-0.62
POSITIVE LOGITS
Cambridge
3.74
Oxford
2.07
Harvard
1.73
Princeton
1.66
Worcester
1.62
Cam
1.59
Copenhagen
1.57
Waterloo
1.48
Plymouth
1.47
Sussex
1.45
Activations Density 0.015%