INDEX
Explanations
mentions of a particular word with a common prefix 'Inf' followed by a varying number
occurrences of the term "Inf" in various contexts
New Auto-Interp
Negative Logits
\\\\\\\\
-0.96
âķIJâķIJ
-0.87
manship
-0.86
BOOK
-0.82
ãĤµãĥ¼ãĥĨãĤ£ãĥ¯ãĥ³
-0.75
ãģį
-0.74
åij
-0.73
ãģ®éŃĶ
-0.72
å§«
-0.72
çͰ
-0.71
POSITIVE LOGITS
rared
1.15
lamm
1.14
iltr
1.13
ertility
1.08
ractions
0.99
ocom
0.99
inia
0.95
ortun
0.95
requently
0.95
inite
0.95
Activations Density 0.009%