Thursday, September 8, 2011

#MAHOUT -796 power iterations doc here

Here it goes, so i don't lose it. Also attached to the issue.


Still, more work to come in MAHOUT-797, although not tremendously a lot...

Thursday, September 1, 2011

#Mahout #MapReduce #SSVD version gets power iterations, fixes

I have been working on MAHOUT-796 this week (current work is in https://github.com/dlyubimov/mahout-commits/tree/MAHOUT-796).


Nathan Halko pointed to me that power iterations in practice improve precision and reduce noise.


After a little discussion, that's the route we went :


 Original formula 

Q=qr[(AA)qAΩ].Q,

B=QA.

Modified version 

Y=AΩ,

B0=[qr(Y).Q]A,

Bi=[qr(ABi1).Q]A,i[1..q].

Notation qr().Q means "compute QR decomposition of the argument and retain Q as a result". 

Current combination of QJob and BtJob is essentially producing B0=Aqr(AΩ).Q. Intermediate QJob results are QR blocks, not a final Q, so QJob is not terribly meaningful without BtJob.

The task boils down to figuring out alternative pipeline modifications necessary to produce Bi. After that, algorithm proceeds as before with assumption of BBq.
 

The existing processing will be equivalent to q=0

Bi pipeline (some new code)

Bi pipeline produces Bi=Aqr(AB).Q.

this is very similar to B0 pipeline with specfics being full multiplication of AB in the first job and first pass qr pushdown to the reducer of the first job:

  • map1 : AB outer products are produced 
  • combiner1 : presumming outer products of AB.
  • reducer1 finalizes summing up outer products of AB and starts qrFirstPass(AB)qr blocks. (partitioning must be done by A block # it seems).
  • mapper2, combiner2, reducer2 proceed exactly as mapper2, combiner2, reducer2 in B pipeline and output final Bi with blocks corresponding to initial splits of A input.

Thus, this pipeline is 2 MR jobs with 2 sorts (map1 + combiner1 + shuffle and sort + reducer1 + map2 + combiner2 + shuffle and sort + reducer2).

Integration of Cholesky trick route for computing power iterations 

Ted Dunning pointed out that the whole B pipeline could avoid QR step.

When power iterations are concened, note that whatever route is chosen to calculate Bi=g(Yi), mathematically it should still be valid i[0..q] for as long as we assume 

Y0=AΩ,i=0;

Yi=ABi1,i>0.
 

So, if Cholesky trick allows to produce B0 efficiently, I see no reason why it could not be applied to producing the Bi.

I will be pursuing MR option of running Cholesky decompositon of YY in MAHOUT-797.
If that holds, then power iterations with Cholesky decomposition will likely be similar. 
LL=YY=RR we know how to compute R;
Bi=(R1)YiA;

Y0=AΩ,i=0;

Yi=ABi1,i>0.


Tests
Tests with local MR QR-based solver seem to be encouraging. There's clear improvement on precision. 
With surrogate random input with predefined singular values 10,4,1,(0.1...), toy input 2000x1000 and k=3,p=10 I seem to be getting improvement after just one additional power iteration:



q=0: 

--SSVD solver singular values:
svs: 9.998472 3.993542 0.990456 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000
q=1: (+2 more MR sequential steps):
--SSVD solver singular values:
svs: 10.000000 4.000000 0.999999 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000
After some tests and cleanup, I guess it is going to become a commit really soon.
Bug fixes
In addition to all this, MAHOUT-796 branch gets a lot of bug fixes (in particular, better handling sparse inputs which seemed to have still been broken in current version).