INDEX
Explanations
phrases related to consequences or outcomes
phrases indicating causation or the result of an action
New Auto-Interp
Negative Logits
pages
-0.82
andon
-0.66
mods
-0.64
itiveness
-0.64
views
-0.64
existent
-0.64
events
-0.64
files
-0.64
bu
-0.63
sama
-0.62
POSITIVE LOGITS
result
1.77
consequence
1.55
precaution
1.46
prerequisite
1.07
reminder
1.05
testament
1.05
consolation
0.99
workaround
0.97
safeguard
0.97
matter
0.97
Activations Density 0.088%