INDEX
Explanations
occurrences of the name "Josh" in various contexts
New Auto-Interp
Negative Logits
athers
-0.17
θε
-0.16
erts
-0.16
ascar
-0.15
tti
-0.15
iyan
-0.15
stood
-0.15
мена
-0.15
Ibrahim
-0.15
eted
-0.15
POSITIVE LOGITS
ua
0.39
UA
0.23
úa
0.20
uae
0.19
ua
0.19
uat
0.18
embro
0.17
uraa
0.17
rees
0.16
uali
0.16
Activations Density 0.005%