INDEX
Explanations
mentions of a specific person named "Collins"
mentions of a specific person named Collins
New Auto-Interp
Negative Logits
ocard
-0.74
ded
-0.68
ICAN
-0.68
oused
-0.67
conduct
-0.65
Bolshe
-0.64
ãĥ¤
-0.62
Narendra
-0.61
cgi
-0.60
visory
-0.60
POSITIVE LOGITS
worth
1.31
Collins
1.31
Collins
1.17
ynski
0.86
mount
0.85
Andersen
0.82
Scully
0.82
ulty
0.76
enhagen
0.75
ville
0.75
Activations Density 0.005%