INDEX
Explanations
phrases or words that draw attention to specific information or issues
the word "highlight" and its variations indicating emphasis or focus on particular subjects
New Auto-Interp
Negative Logits
itte
-0.71
shake
-0.71
ja
-0.69
hell
-0.69
mia
-0.68
thood
-0.67
soever
-0.64
cill
-0.63
onew
-0.63
onz
-0.62
POSITIVE LOGITS
weaknesses
0.93
shortcomings
0.83
similarities
0.81
contradictions
0.80
flaws
0.79
inconsistencies
0.78
highlights
0.77
differences
0.76
milestones
0.75
vulnerabilities
0.75
Activations Density 0.063%