INDEX
Explanations
phrases indicating a lack of progress or realization regarding various concepts or technologies
preceding a negation
not yet achieved
New Auto-Interp
Negative Logits
ſeveral
-0.89
pleaſure
-0.79
ſelf
-0.78
myſelf
-0.77
purpoſe
-0.74
itſelf
-0.73
leſs
-0.72
ſmall
-0.72
miſ
-0.71
ſtate
-0.71
POSITIVE LOGITS
adequately
0.95
vraiment
0.83
véritable
0.82
really
0.81
Yet
0.79
yet
0.78
sufficiently
0.76
any
0.76
seem
0.75
adequate
0.72
Activations Density 0.673%