INDEX
Explanations
words related to intention or purpose
phrases indicating purpose or intent
New Auto-Interp
Negative Logits
uesday
-0.70
natureconservancy
-0.60
reports
-0.54
Transcript
-0.53
Flavoring
-0.52
Mahjong
-0.52
urus
-0.51
ua
-0.51
Zen
-0.50
Reports
-0.50
POSITIVE LOGITS
to
0.95
solely
0.90
primarily
0.87
purely
0.84
principally
0.82
for
0.80
specifically
0.74
to
0.73
chiefly
0.71
exclusively
0.71
Activations Density 0.067%