INDEX
Explanations
words related to people's names, with a focus on the name "Claire"
the end of a document or a text segment
New Auto-Interp
Negative Logits
lar
-0.76
alling
-0.74
ksh
-0.74
balls
-0.72
MODE
-0.70
gra
-0.69
ALS
-0.69
tec
-0.69
BIP
-0.68
CVE
-0.68
POSITIVE LOGITS
ojure
0.97
avier
0.89
oner
0.82
osing
0.79
leans
0.78
othing
0.77
oning
0.76
inant
0.75
xus
0.74
osures
0.74
Activations Density 0.077%