INDEX
Explanations
phrases related to questioning or skepticism
phrases expressing levels of importance or significance
New Auto-Interp
Negative Logits
atl
-0.67
Vide
-0.67
LIB
-0.66
ROR
-0.65
KM
-0.63
Gutenberg
-0.60
alky
-0.60
packages
-0.60
Prosper
-0.59
ette
-0.59
POSITIVE LOGITS
surely
1.16
logically
1.09
understandably
1.06
naturally
1.05
beh
0.96
undoubtedly
0.93
likely
0.92
inevitably
0.91
unavoid
0.91
shouldn
0.90
Activations Density 0.425%