INDEX
Explanations
phrases related to future outcomes or events
references to outcomes and processes
New Auto-Interp
Negative Logits
cknow
-0.71
Haram
-0.69
Presence
-0.67
Invalid
-0.67
jury
-0.67
©¶æ¥µ
-0.65
blank
-0.63
tru
-0.62
tn
-0.61
Unle
-0.61
POSITIVE LOGITS
unfolded
1.08
unfolds
0.99
fared
0.98
pans
0.97
works
0.96
plays
0.94
shakes
0.92
stacks
0.92
fares
0.91
transpired
0.91
Activations Density 0.110%