INDEX
Explanations
mentions of external entities or sources
references to external sources or entities
New Auto-Interp
Negative Logits
killer
-0.89
birds
-0.81
Maker
-0.79
EY
-0.78
ony
-0.77
YR
-0.76
SHIP
-0.73
Maker
-0.71
oned
-0.71
KING
-0.70
POSITIVE LOGITS
ities
1.10
ized
1.00
izing
0.91
ization
0.90
combustion
0.90
izes
0.89
ised
0.85
izable
0.83
affairs
0.81
izations
0.80
Activations Density 0.017%