INDEX
Explanations
references to the word "Simone" and variations thereof, indicating a focus on a specific individual or subject associated with the name
New Auto-Interp
Negative Logits
esh
-0.17
SSI
-0.16
enan
-0.15
oref
-0.15
itel
-0.15
awl
-0.15
ymb
-0.15
ymph
-0.14
NDEBUG
-0.14
обов
-0.14
POSITIVE LOGITS
ulations
0.22
ult
0.22
ulating
0.21
ply
0.20
bol
0.20
ulators
0.20
oleon
0.19
posium
0.19
mons
0.18
sim
0.18
Activations Density 0.011%