INDEX
Explanations
phrases including the words "they're"
references to people or groups and their opinions or actions
New Auto-Interp
Negative Logits
Windsor
-0.69
Catalyst
-0.67
Starts
-0.65
Tasman
-0.65
Garmin
-0.63
Suzuki
-0.62
CY
-0.62
Photographer
-0.61
Pascal
-0.60
Samar
-0.60
POSITIVE LOGITS
were
1.19
selves
1.10
have
1.03
themselves
1.00
are
0.99
selves
0.91
deserve
0.90
tarians
0.87
hip
0.86
ought
0.86
Activations Density 0.210%