INDEX
Explanations
mentions of the term "Hopkins."
mentions of "Hopkins" and "Johns Hopkins University."
New Auto-Interp
Negative Logits
gered
-0.84
cing
-0.78
regate
-0.70
gers
-0.68
ffen
-0.66
pt
-0.66
$$$$
-0.66
razil
-0.66
ãĥł
-0.65
opsis
-0.62
POSITIVE LOGITS
Hopkins
1.18
hip
1.01
kins
0.99
kinson
0.99
bay
0.83
ley
0.81
erenn
0.80
stone
0.80
endish
0.79
Hutch
0.76
Activations Density 0.024%