INDEX
Explanations
texts describing physical scenes or settings
phrases related to actions or events involving people or objects in various contexts
New Auto-Interp
Negative Logits
etheless
-0.87
Lastly
-0.82
zbollah
-0.73
]).
-0.73
Finally
-0.72
"}
-0.71
EStream
-0.69
"))
-0.69
)))
-0.67
Conclusion
-0.66
POSITIVE LOGITS
predecessor
0.54
oret
0.53
first
0.52
ordinarily
0.52
newborn
0.52
agine
0.51
192
0.50
subdiv
0.49
tallest
0.49
predecessors
0.49
Activations Density 1.771%