INDEX
Explanations
phrases related to criticism or evaluation, especially focused on pointing out faults or shortcomings
references to advertisements
New Auto-Interp
Negative Logits
Carbuncle
-0.70
FUL
-0.64
Jr
-0.59
Dame
-0.59
Down
-0.58
Strait
-0.57
specially
-0.57
Cake
-0.56
Sturgeon
-0.56
Pik
-0.56
POSITIVE LOGITS
rift
1.35
hoc
1.32
idas
1.28
roit
1.26
ieu
1.25
hesion
1.25
elaide
1.18
missible
1.17
oration
1.15
verbs
1.14
Activations Density 0.015%