INDEX
Explanations
terms related to traditional practices or beliefs
references to traditional concepts or practices
New Auto-Interp
Negative Logits
Wanted
-0.73
mentation
-0.72
upon
-0.70
Aware
-0.64
Imran
-0.64
anders
-0.62
atoon
-0.62
Allaah
-0.61
leon
-0.61
completely
-0.60
POSITIVE LOGITS
arily
0.93
ists
0.93
ised
0.92
ized
0.90
izations
0.87
istic
0.87
ties
0.86
ization
0.81
ist
0.81
wisdom
0.79
Activations Density 0.028%