INDEX
Explanations
references to toxins and poison-related themes
New Auto-Interp
Negative Logits
كومونز
-0.77
antaranya
-0.77
resave
-0.68
\]
-0.61
PutMapping
-0.61
newBuilder
-0.61
فريبيس
-0.61
ualaikum
-0.60
antaranya
-0.59
cdti
-0.59
POSITIVE LOGITS
poison
1.52
poison
1.42
poisonous
1.34
Poison
1.31
Poison
1.27
toxic
1.24
toxicity
1.22
poisoning
1.20
Toxic
1.19
toxic
1.15
Activations Density 0.489%