INDEX
Explanations
mentions of flaws and weaknesses
terms related to flaws and deficiencies
New Auto-Interp
Negative Logits
rollers
-0.76
Ń·
-0.70
Carbuncle
-0.68
iversary
-0.67
ista
-0.66
fleet
-0.66
eston
-0.65
da
-0.64
xon
-0.63
stad
-0.62
POSITIVE LOGITS
flaws
0.95
plag
0.90
flaw
0.88
weaknesses
0.86
acies
0.85
lessly
0.82
inherent
0.79
ulence
0.77
deficiencies
0.76
Hussein
0.76
Activations Density 0.055%