INDEX
Explanations
instances where an action or event leads to a specific outcome or result
occurrences of the article "a" and its related contexts
New Auto-Interp
Negative Logits
forge
-0.79
itiveness
-0.77
pages
-0.73
\)
-0.66
views
-0.65
conom
-0.64
sama
-0.62
files
-0.62
\.
-0.60
arts
-0.59
POSITIVE LOGITS
result
1.57
precaution
1.35
consequence
1.26
workaround
0.98
gesture
0.94
consolation
0.94
tribute
0.94
reward
0.94
punishment
0.92
reminder
0.91
Activations Density 0.098%