INDEX
Explanations
mentions of the name "Steve"
occurrences of the name "Steve."
New Auto-Interp
Negative Logits
ktop
-0.78
exempt
-0.77
tumblr
-0.75
bound
-0.75
Spoiler
-0.73
appropriately
-0.72
soon
-0.70
spring
-0.70
yout
-0.69
teen
-0.69
POSITIVE LOGITS
Bannon
0.99
Rogers
0.91
Jobs
0.91
Irwin
0.88
Trevor
0.87
McInt
0.86
Allen
0.84
Martin
0.84
Ange
0.84
Steve
0.83
Activations Density 0.009%