INDEX
Explanations
phrases indicating a specific intention or purpose
references to intent in various contexts
New Auto-Interp
Negative Logits
Tycoon
-0.72
Chocobo
-0.68
Cinderella
-0.67
Ibid
-0.64
Occ
-0.63
cub
-0.62
stood
-0.62
Cats
-0.62
Jenner
-0.62
Serge
-0.61
POSITIVE LOGITS
intent
1.29
intent
0.88
intention
0.85
intending
0.84
lessly
0.83
illery
0.82
guiActiveUn
0.79
fulness
0.79
edly
0.75
Intent
0.75
Activations Density 0.005%