INDEX
Explanations
verbs related to actions or events in the past that involve some form of communication or decision-making
end-of-text markers
New Auto-Interp
Negative Logits
adra
-0.76
enegger
-0.73
atever
-0.71
stood
-0.68
sit
-0.68
neath
-0.64
acebook
-0.63
farious
-0.62
omever
-0.61
ankind
-0.60
POSITIVE LOGITS
own
0.65
aback
0.61
]
0.61
monton
0.59
>>
0.58
=====
0.57
><
0.56
âĦ¢:
0.55
)]
0.55
ream
0.54
Activations Density 0.155%