INDEX
Explanations
mentions of things being in specific forms or formats
specific phrases or concepts indicated by the word "form."
New Auto-Interp
Negative Logits
avorite
-0.72
sear
-0.69
Ban
-0.67
Thro
-0.65
dain
-0.65
EStreamFrame
-0.64
Duty
-0.63
Motorsport
-0.63
Rare
-0.63
Hots
-0.62
POSITIVE LOGITS
aldehyde
1.32
ative
1.04
ulating
0.95
fitting
0.93
ulator
0.86
atives
0.84
idable
0.81
ulates
0.81
acion
0.81
ÑĮ
0.78
Activations Density 0.015%