INDEX
Explanations
references to a specific person named "Lor"
the word "lor" and its variations
New Auto-Interp
Negative Logits
Governments
-0.71
BOOK
-0.67
Somali
-0.65
Unch
-0.65
³³³
-0.64
³³
-0.64
Advocate
-0.63
OPLE
-0.63
tails
-0.63
Eliot
-0.62
POSITIVE LOGITS
iculture
1.10
ion
0.98
lor
0.97
inated
0.92
ite
0.92
ith
0.90
oso
0.89
icultural
0.89
apy
0.88
ious
0.88
Activations Density 0.008%