INDEX
Explanations
emphatic expressions like "What a" followed by an adjective or noun
expressions of admiration or emphasis
New Auto-Interp
Negative Logits
:[
-0.64
=-=-=-=-
-0.63
Reports
-0.62
[/
-0.62
ATURES
-0.62
aneously
-0.61
=[
-0.60
pts
-0.60
Lans
-0.59
Ont
-0.59
POSITIVE LOGITS
lot
0.98
heck
0.90
ils
0.88
sexual
0.85
bunch
0.83
person
0.83
historic
0.82
sembly
0.82
mistake
0.80
couple
0.78
Activations Density 0.144%