INDEX
Explanations
mentions of the term "Full"
New Auto-Interp
Negative Logits
affles
-0.70
mone
-0.69
abbit
-0.66
Tycoon
-0.66
akov
-0.65
pher
-0.65
Disciple
-0.64
whis
-0.64
ovan
-0.62
Downloadha
-0.61
POSITIVE LOGITS
erton
1.12
screen
0.98
blown
0.96
fled
0.96
complement
0.94
fledged
0.90
blown
0.89
frontal
0.88
disclosure
0.84
ness
0.84
Activations Density 1.093%