INDEX
Explanations
emphasis on comparative phrases indicating increase or enhancement
New Auto-Interp
Negative Logits
***!
-0.73
-0.71
asgi
-0.70
pylab
-0.68
AnchorStyles
-0.67
myſelf
-0.65
ValueGenerated
-0.63
Вікіпе
-0.63
للمعارف
-0.63
ویکیپدی
-0.61
POSITIVE LOGITS
further
0.80
further
0.75
FURTHER
0.74
Further
0.69
Further
0.68
FURTHER
0.66
进一步
0.60
deeper
0.58
deepening
0.56
deepen
0.55
Activations Density 0.182%