INDEX
Explanations
proper nouns related to a specific individual named "Davis."
mentions of the name "Davis."
New Auto-Interp
Negative Logits
ãĥĥãĥĪ
-0.76
emort
-0.68
isites
-0.68
liest
-0.67
ilater
-0.66
ugal
-0.66
Seym
-0.65
bably
-0.65
stranger
-0.65
folio
-0.64
POSITIVE LOGITS
Davis
0.92
Hanson
0.85
essa
0.80
Davis
0.78
eland
0.75
acre
0.74
ville
0.73
Webb
0.72
ragon
0.72
âĸijâĸij
0.69
Activations Density 0.017%