INDEX
Explanations
proper nouns, specifically names belonging to individuals with the first name Jim
occurrences of the name "Jim."
New Auto-Interp
Negative Logits
selective
-0.74
behold
-0.61
Constructed
-0.59
detection
-0.59
princ
-0.58
BOOK
-0.58
pac
-0.58
screened
-0.57
heck
-0.57
subsequ
-0.57
POSITIVE LOGITS
mie
1.54
enez
1.50
iny
1.05
bo
1.05
mys
1.02
inez
0.98
Butcher
0.96
mi
0.94
Yong
0.85
bos
0.84
Activations Density 0.017%