INDEX
Explanations
references to lists and instructional formats in writing
New Auto-Interp
Negative Logits
æ©
-0.77
ibu
-0.71
Reincarnated
-0.69
xit
-0.62
ahu
-0.62
[+
-0.61
Demand
-0.61
netflix
-0.61
advertising
-0.59
YP
-0.59
POSITIVE LOGITS
summarize
0.94
caveats
0.94
spoilers
0.90
caveat
0.90
spoiler
0.87
ital
0.87
endix
0.85
summar
0.85
suffice
0.85
disclaimer
0.84
Activations Density 0.355%