INDEX
Explanations
instances of the word "expatriate" or its variations in the text
instances of the substring "exp" within words
New Auto-Interp
Negative Logits
thumbs
-0.66
STEM
-0.66
Crom
-0.66
WORK
-0.65
enegger
-0.64
bugs
-0.64
footed
-0.63
Carnival
-0.62
Reconstruction
-0.62
Hipp
-0.62
POSITIVE LOGITS
ropri
1.37
orters
1.37
anse
1.26
orter
1.24
ulsion
1.22
onents
1.20
uls
1.14
iring
1.11
ository
1.10
atri
1.06
Activations Density 0.040%