INDEX
Explanations
phrases related to progress or completion
phrases related to improvement or development over time
New Auto-Interp
Negative Logits
ccording
-0.86
ĸļ
-0.79
deen
-0.79
ntil
-0.77
ailability
-0.77
Palestin
-0.73
everal
-0.72
oun
-0.71
Parables
-0.69
Aires
-0.68
POSITIVE LOGITS
Ez
0.57
q
0.53
...
0.53
gmaxwell
0.52
crew
0.51
pawn
0.51
inx
0.51
cy
0.49
rel
0.49
dish
0.48
Activations Density 0.123%