INDEX
Explanations
values related to increasing percentages or numbers
phrases indicating numerical comparisons involving changes or differences over time
New Auto-Interp
Negative Logits
Shift
-0.71
successfully
-0.71
agy
-0.71
é¾įåĸļ士
-0.68
igent
-0.68
leneck
-0.66
blance
-0.65
erto
-0.65
ira
-0.65
resy
-0.64
POSITIVE LOGITS
afar
0.98
whence
0.93
baseline
0.85
scratch
0.71
conception
0.68
1929
0.62
inception
0.60
yesterday
0.60
previous
0.60
Tycoon
0.59
Activations Density 0.066%