INDEX
Explanations
references to artificial substances or technologies, particularly in the context of health and medicine
New Auto-Interp
Negative Logits
Winona
-0.69
فحة
-0.67
Bangor
-0.65
kapa
-0.63
dwelt
-0.61
Weldon
-0.61
شناسی
-0.61
centerline
-0.61
smoked
-0.60
earns
-0.60
POSITIVE LOGITS
revolution
1.29
Revolution
1.20
artificial
1.13
Revolution
1.06
artificial
1.06
Artificial
1.04
revolution
1.00
Artificial
0.96
artificially
0.96
REVOLUTION
0.90
Activations Density 0.092%