netflixcanopyselector.java

来自「一个简单的mapreduce实现」· Java 代码 · 共 55 行

JAVA

55 行

//// Author - Jack Hebert (jhebert@cs.washington.edu)// Copyright 2007// Distributed under GPLv3//import java.io.IOException;import java.util.*;import org.apache.hadoop.mapred.JobConf;import org.apache.hadoop.io.Text;import org.apache.hadoop.io.Writable;import org.apache.hadoop.io.WritableComparable;import org.apache.hadoop.mapred.MapReduceBase;import org.apache.hadoop.mapred.Mapper;import org.apache.hadoop.mapred.OutputCollector;import org.apache.hadoop.mapred.Reporter;public class NetflixCanopySelector extends MapReduceBase implements Mapper {	// We maintain a list of current canopy centers. We then iterate over each point	// presented to this mapper. If it is not within a minimum distance of any current canopy	// center, then it is a new canopy center and is emitted to the reducer. This does no marking	// of points being in canopies, that is a later map-reduce.		private int count = 0;	private ArrayList<NetflixMovie> canopyCenters;		public void configure(JobConf job) {		this.canopyCenters = new ArrayList<NetflixMovie>();	}		// input:  key is movideID, value is <userID,ranking> tuple	// output: movieID of canopy center, movieID: <userID, ranking> 	public void map(WritableComparable key, Writable values,			OutputCollector output, Reporter reporter) throws IOException {		this.count += 1;		String movie_id = ((Text)key).toString();		String data = ((Text)values).toString();		NetflixMovie curr = new NetflixMovie(movie_id, data);		boolean too_close = false;		Text to_emit = new Text(curr.movie_id+":"+data);		for(NetflixMovie nm: this.canopyCenters) {			int matchCount = nm.MatchCount(curr);			if(matchCount > 10)				too_close = true;		}		if(! too_close) {			output.collect(new Text(curr.movie_id), to_emit);			this.canopyCenters.add(curr);			String toShow = this.canopyCenters.size()+":"+this.count;			reporter.setStatus(toShow);		}	}}

netflixcanopyselector.java - 源码说明

本页面展示了「一个简单的mapreduce实现」中的 netflixcanopyselector.java 源码文件，采用 Java 编程语言编写，共 55 行代码。您可以在线阅读完整代码内容，也可以返回资源详情页下载完整源码包进行本地学习和开发。

虫虫开发者社区收录了大量与MapReduce相关的技术资源，包括源代码、技术文档、电路图等，是电子工程师和嵌入式开发者的专业学习平台。

⌨️ 快捷键说明

复制代码Ctrl + C

搜索代码Ctrl + F

全屏模式F11

增大字号Ctrl + =

减小字号Ctrl + -

显示快捷键?