INDEX
Explanations
names and terms related to individuals or places
mentions of the name "Fran" and its variations
New Auto-Interp
Negative Logits
accompanied
-0.82
gow
-0.78
eenth
-0.77
tomat
-0.75
wrapper
-0.75
scape
-0.74
een
-0.68
Tokens
-0.66
footed
-0.65
stakes
-0.65
POSITIVE LOGITS
Dres
1.12
Fran
1.09
kel
0.94
furt
0.93
zen
0.91
Fran
0.90
Vog
0.84
opol
0.82
ois
0.80
ç
0.80
Activations Density 0.010%