INDEX
Explanations
the word "Lah" or "Lahore"
mentions of a specific name or entity, likely an individual's surname
New Auto-Interp
Negative Logits
xual
-0.74
AGE
-0.72
MENT
-0.68
FOX
-0.68
displayText
-0.66
FACE
-0.65
IBLE
-0.62
PAX
-0.61
pron
-0.61
APE
-0.60
POSITIVE LOGITS
Lah
0.98
awar
0.92
renheit
0.91
itsch
0.83
lah
0.83
acan
0.82
ahah
0.81
IJ
0.80
rang
0.80
hari
0.79
Activations Density 0.007%