INDEX
Explanations
references to users in sentences
mentions of users
New Auto-Interp
Negative Logits
Baptist
-0.68
UNESCO
-0.66
western
-0.65
SourceFile
-0.62
amer
-0.60
Winning
-0.60
Vaugh
-0.60
Maid
-0.59
Lutheran
-0.59
Hurricanes
-0.59
POSITIVE LOGITS
pace
1.15
hip
1.12
cript
1.01
interface
0.89
interfaces
0.88
hare
0.87
ettings
0.87
interface
0.86
paces
0.85
mens
0.85
Activations Density 0.031%