INDEX
Explanations
proper nouns, specifically names of individuals
references to specific people or proper names
New Auto-Interp
Negative Logits
é¾įå¥ij士
-0.72
glers
-0.65
cannabin
-0.64
indebted
-0.59
pests
-0.59
limitation
-0.59
illeg
-0.59
ccording
-0.58
subdiv
-0.57
breadth
-0.56
POSITIVE LOGITS
lees
0.94
andise
0.89
reau
0.83
eus
0.83
uries
0.82
owship
0.81
Garland
0.81
ufact
0.76
alli
0.76
affer
0.74
Activations Density 0.087%