INDEX
Explanations
phrases related to news and events
phrases that reference specific details or attributes of subjects
New Auto-Interp
Negative Logits
Layer
-0.71
thood
-0.70
ĸļ
-0.70
verbs
-0.67
Ships
-0.64
shell
-0.63
countered
-0.63
linux
-0.63
lite
-0.63
Ľ
-0.63
POSITIVE LOGITS
incident
1.16
discrepancy
1.12
ordeal
1.10
latest
1.10
situation
1.09
announcement
1.06
outcome
0.99
findings
0.98
plight
0.95
sudden
0.94
Activations Density 0.612%