INDEX
Explanations
mentions of the city of Berkeley
references to the city of Berkeley
New Auto-Interp
Negative Logits
ugu
-0.86
meaning
-0.71
oire
-0.70
mble
-0.66
ivably
-0.66
demand
-0.65
ollow
-0.64
ACTED
-0.63
BOOK
-0.63
orship
-0.63
POSITIVE LOGITS
keley
1.22
Berkeley
1.02
shire
0.83
halla
0.79
Unified
0.78
Nig
0.76
phrine
0.75
Heights
0.74
Extension
0.72
Marina
0.70
Activations Density 0.006%