INDEX
Explanations
names of individuals
proper nouns, especially names associated with people and places
New Auto-Interp
Negative Logits
breath
-0.72
Redux
-0.62
electronically
-0.62
Constructed
-0.61
rendition
-0.61
centrif
-0.61
Madison
-0.61
idency
-0.61
prevailing
-0.61
speech
-0.61
POSITIVE LOGITS
atures
1.19
er
0.97
atical
0.94
owship
0.93
boat
0.93
ichick
0.93
atural
0.92
ular
0.90
atin
0.88
atics
0.88
Activations Density 0.043%