INDEX
Explanations
items or paragraphs that contain clauses or phrases, particularly those involving criticism or societal reflections
New Auto-Interp
Negative Logits
irling
-0.17
dit
-0.15
è¸
-0.15
igham
-0.14
ÙĥÙĬØ©
-0.14
orna
-0.14
atan
-0.14
ducted
-0.14
ulnerable
-0.14
loat
-0.14
POSITIVE LOGITS
.sendStatus
0.14
Pros
0.14
ald
0.14
atel
0.13
abus
0.13
Hass
0.13
Joi
0.13
fort
0.13
.documentation
0.13
ress
0.12
Activations Density 0.099%