INDEX
Explanations
phrases indicating rarity and classification of items or entities
New Auto-Interp
Negative Logits
verning
-0.77
apo
-0.76
entric
-0.73
akespeare
-0.70
EStreamFrame
-0.69
CLS
-0.65
REP
-0.64
uckland
-0.64
illation
-0.63
ensemble
-0.63
POSITIVE LOGITS
craft
0.88
hood
0.80
ore
0.80
variants
0.80
items
0.79
ãĥ¬
0.77
metals
0.74
coins
0.74
ness
0.73
coins
0.73
Activations Density 0.013%