INDEX
Explanations
phrases related to intentions or plans
occurrences of the word "intention."
New Auto-Interp
Negative Logits
cit
-0.94
java
-0.82
hetti
-0.69
Wolves
-0.67
Tycoon
-0.66
apple
-0.65
Figures
-0.65
angs
-0.65
gars
-0.64
Solitaire
-0.63
POSITIVE LOGITS
ality
1.05
ful
0.83
fulness
0.82
lessly
0.80
intent
0.80
ual
0.75
intentions
0.75
reprene
0.73
ipal
0.73
ually
0.73
Activations Density 0.014%