INDEX
Explanations
words related to safety, security, and situations that are not going well
phrases indicating safety and well-being
New Auto-Interp
Negative Logits
wrongful
-0.77
plagiar
-0.75
grand
-0.73
Insp
-0.71
Giant
-0.69
interstitial
-0.68
Cove
-0.67
Represent
-0.67
auctions
-0.67
parody
-0.66
POSITIVE LOGITS
calmed
1.21
resumed
1.13
awake
1.03
evacuate
1.02
calm
1.02
evacuated
1.02
alright
0.99
cleared
0.97
stabilized
0.95
ready
0.94
Activations Density 1.038%