在flink开发中,如果我们想要去重的话,也会使用到distinct函数,这个distinct函数和sql是一个效果,下面我们用代码演示一下,完整代码示例如下:
package org.example; import org.apache.flink.api.java.DataSet; import org.apache.flink.api.java.ExecutionEnvironment; public class DistinctFunctionJob { public static void main(String[] args) throws Exception{ final ExecutionEnvironment env = ExecutionEnvironment.getExecutionEnvironment(); DataSet<String> dataSource = env.fromElements("张三","李四","王五","赵六","李四","王五","赵六", "张三","李四","王五","李四","王五","赵六","李四" ); dataSource.distinct().print(); } }
运行看看效果:
可以看到被去重了,最后获取到的结果是去重后的结果。
最后按照惯例,附上本案例的源码,登录后即可下载。
还没有评论,来说两句吧...