INDEX
Explanations
references to books and reading
New Auto-Interp
Negative Logits
tonode
-0.47
Books
-0.47
books
-0.44
Books
-0.42
livres
-0.41
BOOKS
-0.40
BOOKS
-0.40
Vijay
-0.40
书
-0.40
complexContent
-0.38
POSITIVE LOGITS
worm
0.80
worms
0.73
ish
0.68
EDEFAULT
0.68
ends
0.68
club
0.64
keeper
0.64
helves
0.64
club
0.63
ended
0.63
Activations Density 0.135%