INDEX
Explanations
sentences related to performance evaluations
conclusive statements or observations
New Auto-Interp
Negative Logits
clandestine
-0.84
hijacked
-0.83
unsuspecting
-0.81
authorized
-0.76
authorised
-0.75
agen
-0.75
ricanes
-0.75
malfunction
-0.74
tricked
-0.74
innocuous
-0.73
POSITIVE LOGITS
Lastly
1.51
Additionally
1.45
Also
1.43
Highly
1.43
Plus
1.42
Definitely
1.40
Furthermore
1.40
Overall
1.38
Moreover
1.37
Especially
1.37
Activations Density 0.396%