INDEX
Explanations
proper nouns related to a specific person or place named "Collins"
mentions of the name "Collins."
New Auto-Interp
Negative Logits
oused
-0.70
ICAN
-0.70
itia
-0.70
ETA
-0.67
istic
-0.66
axter
-0.65
fare
-0.64
discrep
-0.64
iddled
-0.63
ilitation
-0.63
POSITIVE LOGITS
worth
1.17
creen
0.81
es
0.80
ively
0.78
mount
0.77
uit
0.77
Collins
0.77
engers
0.76
Andersen
0.75
boro
0.75
Activations Density 0.057%