SuccessChanges

Summary

  1. [SPARK-23551][BUILD] Exclude `hadoop-mapreduce-client-core` dependency (commit: 2aa66eb387d20725bbd7551d2d77609a77b1e699) (details)
  2. [SPARK-22883][ML][TEST] Streaming tests for spark.ml.feature, from A to (commit: 56cfbd932d3d038ce21cfa4939dfd9563c719003) (details)
  3. [SPARKR][DOC] fix link in vignettes (commit: 8fe20e15196b4ddbd80828ad3a91cf06c5dbea84) (details)
  4. [SPARK-23570][SQL] Add Spark 2.3.0 in HiveExternalCatalogVersionsSuite (commit: f12fa13f16daf0a3f194d78a7e028c8aa7522676) (details)
Commit 2aa66eb387d20725bbd7551d2d77609a77b1e699 by vanzin
[SPARK-23551][BUILD] Exclude `hadoop-mapreduce-client-core` dependency
from `orc-mapreduce`
## What changes were proposed in this pull request?
This PR aims to prevent `orc-mapreduce` dependency from making IDEs and
maven confused.
**BEFORE** Please note that `2.6.4` at `Spark Project SQL`.
```
$ mvn dependency:tree -Phadoop-2.7
-Dincludes=org.apache.hadoop:hadoop-mapreduce-client-core
...
[INFO]
------------------------------------------------------------------------
[INFO] Building Spark Project Catalyst 2.4.0-SNAPSHOT
[INFO]
------------------------------------------------------------------------
[INFO]
[INFO] --- maven-dependency-plugin:3.0.2:tree (default-cli)
spark-catalyst_2.11 ---
[INFO] org.apache.spark:spark-catalyst_2.11:jar:2.4.0-SNAPSHOT
[INFO] \- org.apache.spark:spark-core_2.11:jar:2.4.0-SNAPSHOT:compile
[INFO]    \- org.apache.hadoop:hadoop-client:jar:2.7.3:compile
[INFO]       \-
org.apache.hadoop:hadoop-mapreduce-client-core:jar:2.7.3:compile
[INFO]
[INFO]
------------------------------------------------------------------------
[INFO] Building Spark Project SQL 2.4.0-SNAPSHOT
[INFO]
------------------------------------------------------------------------
[INFO]
[INFO] --- maven-dependency-plugin:3.0.2:tree (default-cli)
spark-sql_2.11 ---
[INFO] org.apache.spark:spark-sql_2.11:jar:2.4.0-SNAPSHOT
[INFO] \- org.apache.orc:orc-mapreduce:jar:nohive:1.4.3:compile
[INFO]    \-
org.apache.hadoop:hadoop-mapreduce-client-core:jar:2.6.4:compile
```
**AFTER**
```
$ mvn dependency:tree -Phadoop-2.7
-Dincludes=org.apache.hadoop:hadoop-mapreduce-client-core
...
[INFO]
------------------------------------------------------------------------
[INFO] Building Spark Project Catalyst 2.4.0-SNAPSHOT
[INFO]
------------------------------------------------------------------------
[INFO]
[INFO] --- maven-dependency-plugin:3.0.2:tree (default-cli)
spark-catalyst_2.11 ---
[INFO] org.apache.spark:spark-catalyst_2.11:jar:2.4.0-SNAPSHOT
[INFO] \- org.apache.spark:spark-core_2.11:jar:2.4.0-SNAPSHOT:compile
[INFO]    \- org.apache.hadoop:hadoop-client:jar:2.7.3:compile
[INFO]       \-
org.apache.hadoop:hadoop-mapreduce-client-core:jar:2.7.3:compile
[INFO]
[INFO]
------------------------------------------------------------------------
[INFO] Building Spark Project SQL 2.4.0-SNAPSHOT
[INFO]
------------------------------------------------------------------------
[INFO]
[INFO] --- maven-dependency-plugin:3.0.2:tree (default-cli)
spark-sql_2.11 ---
[INFO] org.apache.spark:spark-sql_2.11:jar:2.4.0-SNAPSHOT
[INFO] \- org.apache.spark:spark-core_2.11:jar:2.4.0-SNAPSHOT:compile
[INFO]    \- org.apache.hadoop:hadoop-client:jar:2.7.3:compile
[INFO]       \-
org.apache.hadoop:hadoop-mapreduce-client-core:jar:2.7.3:compile
```
## How was this patch tested?
1. Pass the Jenkins with `dev/test-dependencies.sh` with the existing
dependencies. 2. Manually do the following and see the change.
``` mvn dependency:tree -Phadoop-2.7
-Dincludes=org.apache.hadoop:hadoop-mapreduce-client-core
```
Author: Dongjoon Hyun <dongjoon@apache.org>
Closes #20704 from dongjoon-hyun/SPARK-23551.
(cherry picked from commit 34811e0b908449fd59bca476604612b1d200778d)
Signed-off-by: Marcelo Vanzin <vanzin@cloudera.com>
(commit: 2aa66eb387d20725bbd7551d2d77609a77b1e699)
The file was modifiedpom.xml (diff)
Commit 56cfbd932d3d038ce21cfa4939dfd9563c719003 by joseph
[SPARK-22883][ML][TEST] Streaming tests for spark.ml.feature, from A to
H
## What changes were proposed in this pull request?
Adds structured streaming tests using testTransformer for these suites:
* BinarizerSuite
* BucketedRandomProjectionLSHSuite
* BucketizerSuite
* ChiSqSelectorSuite
* CountVectorizerSuite
* DCTSuite.scala
* ElementwiseProductSuite
* FeatureHasherSuite
* HashingTFSuite
## How was this patch tested?
It tests itself because it is a bunch of tests!
Author: Joseph K. Bradley <joseph@databricks.com>
Closes #20111 from jkbradley/SPARK-22883-streaming-featureAM.
(cherry picked from commit 119f6a0e4729aa952e811d2047790a32ee90bf69)
Signed-off-by: Joseph K. Bradley <joseph@databricks.com>
(commit: 56cfbd932d3d038ce21cfa4939dfd9563c719003)
The file was modifiedmllib/src/test/scala/org/apache/spark/ml/feature/CountVectorizerSuite.scala (diff)
The file was modifiedmllib/src/test/scala/org/apache/spark/ml/feature/BucketizerSuite.scala (diff)
The file was modifiedmllib/src/test/scala/org/apache/spark/ml/feature/FeatureHasherSuite.scala (diff)
The file was modifiedmllib/src/test/scala/org/apache/spark/ml/feature/ElementwiseProductSuite.scala (diff)
The file was modifiedmllib/src/test/scala/org/apache/spark/ml/feature/BinarizerSuite.scala (diff)
The file was modifiedmllib/src/test/scala/org/apache/spark/ml/feature/DCTSuite.scala (diff)
The file was modifiedmllib/src/test/scala/org/apache/spark/ml/feature/HashingTFSuite.scala (diff)
The file was modifiedmllib/src/test/scala/org/apache/spark/ml/feature/BucketedRandomProjectionLSHSuite.scala (diff)
The file was modifiedmllib/src/test/scala/org/apache/spark/ml/feature/ChiSqSelectorSuite.scala (diff)
Commit 8fe20e15196b4ddbd80828ad3a91cf06c5dbea84 by felixcheung
[SPARKR][DOC] fix link in vignettes
## What changes were proposed in this pull request?
Fix doc link that was changed in 2.3
shivaram
Author: Felix Cheung <felixcheung_m@hotmail.com>
Closes #20711 from felixcheung/rvigmean.
(cherry picked from commit 0b6ceadeb563205cbd6bd03bc88e608086273b5b)
Signed-off-by: Felix Cheung <felixcheung@apache.org>
(commit: 8fe20e15196b4ddbd80828ad3a91cf06c5dbea84)
The file was modifiedR/pkg/vignettes/sparkr-vignettes.Rmd (diff)
Commit f12fa13f16daf0a3f194d78a7e028c8aa7522676 by gatorsmile
[SPARK-23570][SQL] Add Spark 2.3.0 in HiveExternalCatalogVersionsSuite
## What changes were proposed in this pull request? Add Spark 2.3.0 in
HiveExternalCatalogVersionsSuite since Spark 2.3.0 is released for
ensuring backward compatibility.
## How was this patch tested? N/A
Author: gatorsmile <gatorsmile@gmail.com>
Closes #20720 from gatorsmile/add2.3.
(cherry picked from commit 487377e693af65b2ff3d6b874ca7326c1ff0076c)
Signed-off-by: gatorsmile <gatorsmile@gmail.com>
(commit: f12fa13f16daf0a3f194d78a7e028c8aa7522676)
The file was modifiedsql/hive/src/test/scala/org/apache/spark/sql/hive/HiveExternalCatalogVersionsSuite.scala (diff)