INDEX
Explanations
phrases related to doing something for a specific purpose or duration
phrases that express actions or sentiments done for others
New Auto-Interp
Negative Logits
operated
-0.69
iasco
-0.66
gars
-0.65
glers
-0.65
atars
-0.65
abis
-0.64
atories
-0.64
ering
-0.64
quartered
-0.62
arts
-0.62
POSITIVE LOGITS
awhile
1.18
breakfast
1.04
got
1.02
lunch
1.00
example
0.99
fun
0.99
reasons
0.94
instance
0.94
dinner
0.93
supper
0.91
Activations Density 0.128%