INDEX
Explanations
phrases related to being isolated or isolating others
terms related to isolation and its effects
New Auto-Interp
Negative Logits
enegger
-0.99
orah
-0.98
mington
-0.76
orthy
-0.69
vous
-0.67
soDeliveryDate
-0.64
deen
-0.64
ickr
-0.64
ibal
-0.64
ruary
-0.63
POSITIVE LOGITS
isolated
0.87
isolation
0.86
confinement
0.80
ivities
0.74
olated
0.74
ously
0.72
ively
0.70
ism
0.69
isol
0.66
ivity
0.65
Activations Density 0.038%