INDEX
Explanations
single-letter detections located within a word
occurrences of an empty or blank text segment
New Auto-Interp
Negative Logits
Angus
-0.69
Emerson
-0.69
appointments
-0.67
Allied
-0.66
Mellon
-0.66
Eag
-0.65
Borders
-0.63
Jagu
-0.63
Osh
-0.62
Clarkson
-0.61
POSITIVE LOGITS
cess
0.87
sexual
0.85
ria
0.85
lex
0.83
vec
0.83
ird
0.81
lder
0.81
guest
0.80
][
0.80
ctors
0.75
Activations Density 0.053%