INDEX
Explanations
descriptions of physical characteristics or actions associated with a particular person
descriptions or references to specific individuals, particularly in a narrative context
New Auto-Interp
Negative Logits
Shape
-0.61
Picture
-0.61
':
-0.55
Belfast
-0.54
Firstly
-0.53
Patreon
-0.52
cryptocurrency
-0.51
lations
-0.51
\":
-0.51
Brexit
-0.51
POSITIVE LOGITS
.")
0.79
)."
0.78
]."
0.67
.'"
0.65
Vaugh
0.62
.).
0.61
}}
0.61
"/>
0.61
").
0.61
'."
0.60
Activations Density 2.973%