Assembler in c#

Clive Tooth · Mar 17, 2012

Here is some code that I would like to speed up...

=================================
unchecked
{
for (int i = startI; i < stopI; i++)
{
uint m = (a << 8) | carry;
carry = m/myBase;
if ((a = m%myBase) == 0)
{
WorkerFoundZero(myNumber, i);
}
} // i loop
} // unchecked
=================================

a[], carry, myBase are uints. It seems that I need to do *two* divide
instructions "m/myBase" and "m%myBase". Of course, the processor has a
DIV instruction that returns both the quotient and the remainder. How
can I get access to both results? Math.DivRem does not seem
particularly fast, also it only accepts int and long operands.

Ideally, I would like to be able to write a couple of lines of
assembler in my c# program.

Clive Tooth · Mar 18, 2012

Here is some code that I would like to speed up...

=================================
unchecked
{
for (int i = startI; i < stopI; i++)
{
uint m = (a << 8) | carry;
carry = m/myBase;
if ((a = m%myBase) == 0)
{
WorkerFoundZero(myNumber, i);
}
} // i loop
} // unchecked
=================================

a[], carry, myBase are uints. It seems that I need to do *two* divide
instructions "m/myBase" and "m%myBase". Of course, the processor has a
DIV instruction that returns both the quotient and the remainder. How
can I get access to both results? Math.DivRem does not seem
particularly fast, also it only accepts int and long operands.

Ideally, I would like to be able to write a couple of lines of
assembler in my c# program.

Click to expand...

C# is good at a lot of different things, but inline assembly isn't one of
them.

That said, you have already done the division. Division is way more
expensive than addition or multiplication. So why not take advantage of
the result you already have:

carry = m / myBase;
remainder = m - carry * myBase;
if ((a = remainder) == 0)
{
...

?

I didn't write a benchmark to actually compare, but it's possible that
would be significantly faster than the two divisions.

Of course, that assumes that the JIT compiler is already not optimizing
your two operations into a single DIV instruction. Unless you've looked at
the generated machine code, you should not assume it's not.

Wow! That that change virtually doubled the speed of the loop. Thanks,
Pete.

Background: This is my runs-of-zeros-in-a-power-of-two program.
I am searching for a run of 18 consecutive zeros. I had changed the
program so that it would detect if the current power of two has a run
of at least 13 consecutive zeros. [I changed the internal
represenation from base one billion to base ten million.] If such a
run is present the program simply proceeds to the next power of two by
doubling the current one. If such a run is not present the program
multiplies the current power by 256. The effect of that change was to
make the program about 2.5 times faster. [Reducing the picoseconds per
digit metric to about 36.] For powers of two in the region of 2^(10^9)
it turns out that the program can multiply by 256 about 99.99% of the
time. Your change has further reduced the ps/digit figure to about 19.

Sudden speed up in a C# program	3	Sep 17, 2012
Jeffrey Tan MSFT forget me? Virtual Override Events Inheritance	7	Nov 26, 2003
Binary numeric promotions question	2	Oct 25, 2009
It Won't Load My Assembly And It Won't Say Why	5	Dec 8, 2008
program has initial error on start up but runs ok	2	Mar 29, 2013
P/Invoke CreateWindowStation and protected memory	1	Jun 4, 2008
An interesting/annoying problem involving callbacks and native C and C#	6	Feb 12, 2007
Interop DllImport in Class Library	2	Mar 16, 2007

Assembler in c#

Clive Tooth

Clive Tooth

Ask a Question

Similar Threads