INDEX
Explanations
phrases associated with critiques or criticisms
mentions of critics and their evaluations
New Auto-Interp
Negative Logits
Alert
-0.69
rex
-0.68
âĸĪ
-0.65
full
-0.63
shi
-0.61
nm
-0.61
Kis
-0.59
cycle
-0.59
Volunte
-0.59
rax
-0.58
POSITIVE LOGITS
critics
3.79
detractors
2.96
Critics
2.58
Critics
2.42
skeptics
2.13
critic
2.12
opponents
2.03
reviewers
1.97
criticisms
1.89
proponents
1.87
Activations Density 0.013%