INDEX
Explanations
modified versions or adaptations of something
references to items or concepts that have been altered or adapted
New Auto-Interp
Negative Logits
çĦ
-0.84
ä
-0.73
HI
-0.70
OPA
-0.69
Water
-0.67
vor
-0.66
riel
-0.66
tale
-0.65
gary
-0.65
True
-0.64
POSITIVE LOGITS
atile
0.88
xual
0.84
modification
0.83
modifications
0.82
icum
0.79
wrench
0.78
iations
0.78
mitigation
0.78
hap
0.77
organisms
0.75
Activations Density 0.015%