INDEX
Explanations
phrases related to returning people or sending them home
New Auto-Interp
Negative Logits
eyebrow
-0.78
predictive
-0.68
streak
-0.66
gauge
-0.64
ĸļ
-0.64
Gamble
-0.63
Spotlight
-0.61
ggles
-0.61
brainstorm
-0.60
watt
-0.60
POSITIVE LOGITS
safely
1.17
captivity
1.11
peacefully
1.10
alive
1.09
unlawfully
0.96
intact
0.92
captives
0.87
permanently
0.87
ylum
0.86
voluntarily
0.85
Activations Density 0.214%