google app engine - How to read/write (String and Array like) variables during Mapreduce iteration? -
i trying process large amounts of data of order 5-10 million.
i running mapper in googleappengine/java task rate of 100/s , bucket-size of 100 billing enabled.
reading , writing datastore during map iteration affects overall speed very large extent. if can read/write string , array variables other simple counters, speed things large extent.
background: trying dedupe large data respect multiple text fields. must run n map jobs , compare rest of data.
Comments
Post a Comment