INDEX
Explanations
mentions of places, particularly emphasizing the word 'Ang'
instances of the term "Ang" in various contexts
New Auto-Interp
Negative Logits
externalToEVAOnly
-0.80
ÄŁ
-0.76
ãĥ¼ãĥ³
-0.74
cloth
-0.71
ords
-0.63
grades
-0.63
ño
-0.63
CTR
-0.62
ngth
-0.61
compensated
-0.60
POSITIVE LOGITS
rily
1.24
uish
1.14
lia
1.08
lers
0.96
sty
0.92
emouth
0.92
ular
0.86
regor
0.86
strom
0.83
los
0.82
Activations Density 0.025%