INDEX
Explanations
positive expressions or actions
phrases related to significant changes or developments in various contexts
New Auto-Interp
Negative Logits
iru
-0.61
laun
-0.59
inki
-0.55
Accessory
-0.54
vice
-0.53
cig
-0.53
MJ
-0.51
vg
-0.50
Ambro
-0.49
article
-0.48
POSITIVE LOGITS
lately
1.10
since
0.91
recently
0.71
since
0.70
countless
0.67
numerous
0.65
previously
0.60
successfully
0.59
successfully
0.58
extensively
0.58
Activations Density 1.152%