INDEX
Explanations
statements that introduce additional information or context in the text
New Auto-Interp
Negative Logits
crow
-0.73
throats
-0.69
ocene
-0.68
abies
-0.64
hearts
-0.62
RED
-0.60
fell
-0.58
Balkans
-0.58
ICAN
-0.57
stay
-0.57
POSITIVE LOGITS
importantly
0.91
entimes
0.83
,.
0.79
.,
0.77
guiActiveUn
0.71
è£ħ
0.71
interestingly
0.70
adays
0.67
,
0.67
ãĥ»
0.66
Activations Density 0.014%