INDEX
Explanations
phrases related to criticism and responses to public scrutiny
New Auto-Interp
Negative Logits
ossed
-0.15
daq
-0.15
ichni
-0.14
exels
-0.14
serde
-0.14
ResourceId
-0.14
alborg
-0.14
acho
-0.14
yonel
-0.13
aney
-0.13
POSITIVE LOGITS
criticism
0.62
critics
0.56
Crit
0.55
crit
0.55
Critics
0.53
criticisms
0.51
Crit
0.50
critic
0.47
critique
0.47
critiques
0.46
Activations Density 0.528%