INDEX
Explanations
phrases or segments related to news articles or reports
phrases that include the word "which" to identify descriptions or explanations
New Auto-Interp
Negative Logits
Behind
-0.73
cup
-0.73
athi
-0.72
uta
-0.69
rollers
-0.67
Solution
-0.67
ben
-0.65
die
-0.64
angu
-0.63
nor
-0.60
POSITIVE LOGITS
culmin
0.95
comprises
0.93
lasted
0.92
consisted
0.88
prompted
0.88
culminated
0.88
consists
0.87
amounted
0.85
resulted
0.84
originated
0.82
Activations Density 0.104%