Here it goes, so i don't lose it. Also attached to the issue.
Still, more work to come in MAHOUT-797, although not tremendously a lot...
Thursday, September 8, 2011
Thursday, September 1, 2011
#Mahout #MapReduce #SSVD version gets power iterations, fixes
I have been working on MAHOUT-796 this week (current work is in https://github.com/dlyubimov/mahout-commits/tree/MAHOUT-796).
Nathan Halko pointed to me that power iterations in practice improve precision and reduce noise.
After a little discussion, that's the route we went :
Q=qr[(AA⊤)qAΩ].Q,
B=Q⊤A.
Y=AΩ,
B0=[qr(Y).Q]⊤A,
Bi=[qr(AB⊤i−1).Q]⊤A,i∈[1..q].
The task boils down to figuring out alternative pipeline modifications necessary to produce B⊤i. After that, algorithm proceeds as before with assumption of B≡Bq.
Bi pipeline produces B⊤i=A⊤qr(AB⊤).Q.
Y0=AΩ,i=0;
Yi=AB⊤i−1,i>0.
Y0=AΩ,i=0;
Yi=AB⊤i−1,i>0.
Bug fixes
Nathan Halko pointed to me that power iterations in practice improve precision and reduce noise.
After a little discussion, that's the route we went :
Original formula
Q=qr[(AA⊤)qAΩ].Q,
B=Q⊤A.
Modified version
Y=AΩ,
B0=[qr(Y).Q]⊤A,
Bi=[qr(AB⊤i−1).Q]⊤A,i∈[1..q].
Notation qr(⋅).Q means "compute QR decomposition of the argument and retain Q as a result".
Current combination of QJob and BtJob is essentially producing B⊤0=A⊤qr(AΩ).Q. Intermediate QJob results are QR blocks, not a final Q, so QJob is not terribly meaningful without BtJob.
The task boils down to figuring out alternative pipeline modifications necessary to produce B⊤i. After that, algorithm proceeds as before with assumption of B≡Bq.
The existing processing will be equivalent to q=0.
Bi pipeline (some new code)
Bi pipeline produces B⊤i=A⊤qr(AB⊤).Q.
this is very similar to B0 pipeline with specfics being full multiplication of AB⊤ in the first job and first pass qr pushdown to the reducer of the first job:
- map1 : AB⊤ outer products are produced
- combiner1 : presumming outer products of AB⊤.
- reducer1 finalizes summing up outer products of AB⊤ and starts qrFirstPass(AB⊤)→qr blocks. (partitioning must be done by A block # it seems).
- mapper2, combiner2, reducer2 proceed exactly as mapper2, combiner2, reducer2 in B pipeline and output final B⊤i with blocks corresponding to initial splits of A input.
Thus, this pipeline is 2 MR jobs with 2 sorts (map1 + combiner1 + shuffle and sort + reducer1 + map2 + combiner2 + shuffle and sort + reducer2).
Integration of Cholesky trick route for computing power iterations
Ted Dunning pointed out that the whole B pipeline could avoid QR step.
When power iterations are concened, note that whatever route is chosen to calculate Bi=g(Yi), mathematically it should still be valid ∀i∈[0..q] for as long as we assume
Y0=AΩ,i=0;
Yi=AB⊤i−1,i>0.
So, if Cholesky trick allows to produce B0 efficiently, I see no reason why it could not be applied to producing the Bi.
I will be pursuing MR option of running Cholesky decompositon of Y⊤Y in MAHOUT-797.
If that holds, then power iterations with Cholesky decomposition will likely be similar.
LL⊤=Y⊤Y=R⊤R⇒ we know how to compute R;
Bi=(R−1)⊤Yi⊤A;
Y0=AΩ,i=0;
Yi=AB⊤i−1,i>0.
Tests
Tests with local MR QR-based solver seem to be encouraging. There's clear improvement on precision.
With surrogate random input with predefined singular values 10,4,1,(0.1...), toy input 2000x1000 and k=3,p=10 I seem to be getting improvement after just one additional power iteration:
q=0:
--SSVD solver singular values:
svs: 9.998472 3.993542 0.990456 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000
svs: 9.998472 3.993542 0.990456 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000
q=1: (+2 more MR sequential steps):
--SSVD solver singular values:
svs: 10.000000 4.000000 0.999999 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000
--SSVD solver singular values:
svs: 10.000000 4.000000 0.999999 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000 0.100000
After some tests and cleanup, I guess it is going to become a commit really soon.
In addition to all this, MAHOUT-796 branch gets a lot of bug fixes (in particular, better handling sparse inputs which seemed to have still been broken in current version).
Subscribe to:
Posts (Atom)