INDEX
Explanations
proper names or names of people, potentially focusing on a specific name "Fran"
mentions of specific names, particularly "Fran" and "Cait"
New Auto-Interp
Negative Logits
eenth
-0.80
wrapper
-0.70
gow
-0.68
wagen
-0.68
Graveyard
-0.67
orld
-0.66
ylum
-0.66
andr
-0.66
ivid
-0.65
ambers
-0.65
POSITIVE LOGITS
kel
1.07
Dres
1.04
Fran
0.97
zen
0.93
furt
0.89
thal
0.85
ck
0.83
cium
0.81
tha
0.79
rier
0.78
Activations Density 0.008%