INDEX
Explanations
conditional statements or questions related to belief or opinion
New Auto-Interp
Negative Logits
onement
-0.71
resa
-0.65
Reply
-0.63
NetMessage
-0.63
udding
-0.63
idency
-0.62
âĸij
-0.61
lets
-0.60
Solution
-0.59
soDeliveryDate
-0.59
POSITIVE LOGITS
technically
1.17
admittedly
0.96
ostensibly
0.86
occasional
0.81
physically
0.81
otherwise
0.81
outward
0.75
occasionally
0.75
slight
0.73
initially
0.73
Activations Density 0.283%