INDEX
Explanations
proper nouns or names, specifically focused on the name "Troy"
mentions of the word "Troy."
New Auto-Interp
Negative Logits
INESS
-0.81
awar
-0.75
pmwiki
-0.73
iquid
-0.72
ãĥĥãĤ¯
-0.70
ubric
-0.69
ivia
-0.68
pity
-0.68
merga
-0.67
ivities
-0.67
POSITIVE LOGITS
alty
0.91
alties
0.90
ij士
0.81
neau
0.79
al
0.77
imet
0.75
don
0.75
tics
0.74
er
0.73
nce
0.72
Activations Density 0.032%