INDEX
Explanations
timestamps in text
time and date indicators, as well as terms related to rarity
New Auto-Interp
Negative Logits
istically
-0.67
ffield
-0.67
ignment
-0.67
itiz
-0.66
spons
-0.64
buffs
-0.61
preservation
-0.61
fts
-0.60
gettable
-0.60
isable
-0.60
POSITIVE LOGITS
<|endoftext|>
0.76
âĢº
0.73
MENTS
0.72
||
0.72
NOTE
0.69
ENN
0.68
Thanks
0.68
Posted
0.67
DEFENSE
0.67
Edited
0.67
Activations Density 0.018%