INDEX
Explanations
references to specific or particular items or actions
instances of the word "specifically."
New Auto-Interp
Negative Logits
Isles
-0.81
ulton
-0.74
anon
-0.71
lyn
-0.71
Kenn
-0.64
Afee
-0.64
ILY
-0.64
ocene
-0.63
izoph
-0.61
former
-0.61
POSITIVE LOGITS
tailored
1.02
targeted
0.98
exempted
0.88
formulated
0.87
designed
0.86
geared
0.85
suited
0.84
tuned
0.83
targeting
0.83
engineered
0.82
Activations Density 0.022%