INDEX
Explanations
phrases related to news articles or statements
instances of structured statements or reports
New Auto-Interp
Negative Logits
¥µ
-0.68
ĪĴ
-0.62
kees
-0.61
aturdays
-0.61
endeavor
-0.61
utical
-0.60
Cipher
-0.59
ĸļ
-0.59
ĺħ
-0.58
igslist
-0.58
POSITIVE LOGITS
Britain
0.96
Asked
0.91
However
0.89
Labour
0.87
Shape
0.85
Speaking
0.85
Scotland
0.81
BBC
0.80
SPONSORED
0.79
Writing
0.79
Activations Density 0.577%