INDEX
Explanations
references to cabins
references to cabins and related terms
New Auto-Interp
Negative Logits
ku
-0.71
lda
-0.69
avez
-0.67
thodox
-0.65
FW
-0.65
oppable
-0.64
ammad
-0.63
Uz
-0.63
GOODMAN
-0.63
henko
-0.63
POSITIVE LOGITS
cabin
1.22
etry
0.89
Seat
0.84
illo
0.84
Cabin
0.84
door
0.82
artment
0.80
compartment
0.78
igans
0.74
cab
0.72
Activations Density 0.012%