INDEX
Explanations
phrases indicating a consequence or outcome
phrases that indicate a result or consequence
New Auto-Interp
Negative Logits
dayName
-0.61
displayText
-0.60
MpServer
-0.59
Cosponsors
-0.59
Packs
-0.58
share
-0.58
visors
-0.58
ItemTracker
-0.58
intend
-0.57
TOUR
-0.56
POSITIVE LOGITS
undone
1.00
overs
0.98
oir
0.81
ipp
0.74
unexplained
0.73
wich
0.73
lem
0.69
untreated
0.69
hift
0.68
mia
0.68
Activations Density 0.039%