Build #144549

Environment variables

NameValue
ANDROID_HOME/home/android-sdk/
AWS_ACCESS_KEY_ID[*******]
AWS_SECRET_ACCESS_KEY[*******]
BUILD_CAUSEGHPRBCAUSE
BUILD_CAUSE_GHPRBCAUSEtrue
BUILD_DISPLAY_NAME#144549
BUILD_ID144549
BUILD_NUMBER144549
BUILD_TAGjenkins-SparkPullRequestBuilder-144549
BUILD_URLhttps://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144549/
CLASSPATH$CLASSPATH
DBUS_SESSION_BUS_ADDRESSunix:path=/run/user/1001/bus
EXECUTOR_NUMBER2
GITHUB_OAUTH_KEY[*******]
GIT_BRANCHSPARK-37022
GIT_COMMITcdd3d77ebe1a2af0f527e2e9e938026b17d8b175
GIT_PREVIOUS_COMMIT7a32ff4d8af3a2fb212d0b159df6db69f9d71399
GIT_PREVIOUS_SUCCESSFUL_COMMIT7a32ff4d8af3a2fb212d0b159df6db69f9d71399
GIT_URLhttps://github.com/apache/spark.git
HOME/home/jenkins
HUDSON_HOME/var/lib/jenkins
HUDSON_SERVER_COOKIE472906e9832aeb79
HUDSON_URLhttps://amplab.cs.berkeley.edu/jenkins/
JAVA_HOME/usr/java/latest
JENKINS_HOME/var/lib/jenkins
JENKINS_SERVER_COOKIE472906e9832aeb79
JENKINS_URLhttps://amplab.cs.berkeley.edu/jenkins/
JOB_BASE_NAMESparkPullRequestBuilder
JOB_NAMESparkPullRequestBuilder
JOB_URLhttps://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/
LANGen_US.UTF-8
LOGNAMEjenkins
MOTD_SHOWNpam
NODE_LABELSresearch-jenkins-worker-05 ubuntu ubuntu20 worker-05
NODE_NAMEresearch-jenkins-worker-05
OLDPWD/home/jenkins
PATH/home/jenkins/tools/hudson.tasks.Maven_MavenInstallation/Maven_3.6.3/bin/:/home/jenkins/gems/bin:/usr/local/go/bin:/home/jenkins/go-projects/bin:/home/jenkins/anaconda2/bin:/home/jenkins/tools/hudson.tasks.Maven_MavenInstallation/Maven_3.6.3/bin/:/home/jenkins/gems/bin:/usr/local/go/bin:/home/jenkins/go-projects/bin:/home/jenkins/anaconda2/bin:$PATH
PWD/home/jenkins
ROOT_BUILD_CAUSEGHPRBCAUSE
ROOT_BUILD_CAUSE_GHPRBCAUSEtrue
RUN_ARTIFACTS_DISPLAY_URLhttps://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144549/display/redirect?page=artifacts
RUN_CHANGES_DISPLAY_URLhttps://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144549/display/redirect?page=changes
RUN_DISPLAY_URLhttps://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144549/display/redirect
RUN_TESTS_DISPLAY_URLhttps://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/144549/display/redirect?page=tests
SHELL/bin/bash
SHLVL0
SPARK_TEST_KEY[*******]
SSH_CLIENT192.168.10.11 55710 22
SSH_CONNECTION192.168.10.11 55710 192.168.10.25 22
USERjenkins
WORKSPACE/home/jenkins/workspace/SparkPullRequestBuilder
XDG_RUNTIME_DIR/run/user/1001
XDG_SESSION_CLASSuser
XDG_SESSION_ID3
XDG_SESSION_TYPEtty
_/usr/java/latest/bin/java
ghprbActualCommit34559c901b93221f2866030248fcd1f3dfa82e0d
ghprbActualCommitAuthorzero323
ghprbActualCommitAuthorEmailmszymkiewicz@gmail.com
ghprbAuthorRepoGitUrlhttps://github.com/zero323/spark.git
ghprbCommentBodynull
ghprbCredentialsIdb7d94526-9e9b-435f-9275-d7dbf209f4a3
ghprbGhRepositoryapache/spark
ghprbPullAuthorEmail
ghprbPullAuthorLoginzero323
ghprbPullAuthorLoginMention@zero323
ghprbPullDescriptionGitHub pull request #34297 of commit 34559c901b93221f2866030248fcd1f3dfa82e0d, no merge conflicts.
ghprbPullId34297
ghprbPullLinkhttps://github.com/apache/spark/pull/34297
ghprbPullLongDescription<!--\r\nThanks for sending a pull request!  Here are some tips for you:\r\n  1. If this is your first time, please read our contributor guidelines: https://spark.apache.org/contributing.html\r\n  2. Ensure you have added or run the appropriate tests for your PR: https://spark.apache.org/developer-tools.html\r\n  3. If the PR is unfinished, add '[WIP]' in your PR title, e.g., '[WIP][SPARK-XXXX] Your PR title ...'.\r\n  4. Be sure to keep the PR description updated to reflect all changes.\r\n  5. Please write your PR title to summarize what this PR proposes.\r\n  6. If possible, provide a concise example to reproduce the issue for a faster review.\r\n  7. If you want to add a new configuration, please read the guideline first for naming configurations in\r\n     'core/src/main/scala/org/apache/spark/internal/config/ConfigEntry.scala'.\r\n  8. If you want to add or modify an error type or message, please read the guideline first in\r\n     'core/src/main/resources/error/README.md'.\r\n-->\r\n\r\n### What changes were proposed in this pull request?\r\n<!--\r\nPlease clarify what changes you are proposing. The purpose of this section is to outline the changes and how this PR fixes the issue. \r\nIf possible, please consider writing useful notes for better and faster reviews in your PR. See the examples below.\r\n  1. If you refactor some codes with changing classes, showing the class hierarchy will help reviewers.\r\n  2. If you fix some SQL features, you can provide some references of other DBMSes.\r\n  3. If there is design documentation, please add the link.\r\n  4. If there is a discussion in the mailing list, please add the link.\r\n-->\r\n\r\nThis PR applies `black` (21.5b2) formatting to the whole `python/pyspark` source tree.\r\n\r\nAdditionally, the following changes were made:\r\n\r\n- Disabled E501 (line too long) in pycodestyle config ‒ black allows line to exceed `line-length` in cases of inline comments.  There are 15 cases like this, all listed below\r\n    ```\r\n    pycodestyle checks failed:\r\n    ./python/pyspark/sql/catalog.py:349:101: E501 line too long (103 > 100 characters)\r\n    ./python/pyspark/sql/session.py:652:101: E501 line too long (106 > 100 characters)\r\n    ./python/pyspark/sql/utils.py:50:101: E501 line too long (108 > 100 characters)\r\n    ./python/pyspark/sql/streaming.py:1063:101: E501 line too long (128 > 100 characters)\r\n    ./python/pyspark/sql/streaming.py:1071:101: E501 line too long (112 > 100 characters)\r\n    ./python/pyspark/sql/streaming.py:1080:101: E501 line too long (124 > 100 characters)\r\n    ./python/pyspark/sql/streaming.py:1259:101: E501 line too long (134 > 100 characters)\r\n    ./python/pyspark/sql/pandas/conversion.py:136:101: E501 line too long (106 > 100 characters)\r\n    ./python/pyspark/ml/param/_shared_params_code_gen.py:111:101: E501 line too long (103 > 100 characters)\r\n    ./python/pyspark/ml/param/_shared_params_code_gen.py:136:101: E501 line too long (105 > 100 characters)\r\n    ./python/pyspark/ml/param/_shared_params_code_gen.py:163:101: E501 line too long (101 > 100 characters)\r\n    ./python/pyspark/ml/param/_shared_params_code_gen.py:233:101: E501 line too long (101 > 100 characters)\r\n    ./python/pyspark/ml/param/_shared_params_code_gen.py:265:101: E501 line too long (101 > 100 characters)\r\n    ./python/pyspark/tests/test_readwrite.py:235:101: E501 line too long (114 > 100 characters)\r\n    ./python/pyspark/tests/test_readwrite.py:336:101: E501 line too long (114 > 100 characters)\r\n    ```\r\n- After reformatting, minor typing changes were made:\r\n   - Realign certain `type: ignore` comments with ignored code.\r\n   -  Apply explicit `casts` to  ` @unittest.skipIf` messages.  The following\r\n      ```python\r\n      @unittest.skipIf(\r\n          not have_pandas or not have_pyarrow, pandas_requirement_message or pyarrow_requirement_message\r\n      )  # type: ignore[arg-type]\r\n      ```\r\n      replaced with\r\n              \r\n      ```python\r\n      @unittest.skipIf(\r\n          not have_pandas or not have_pyarrow, \r\n          cast(str, pandas_requirement_message or pyarrow_requirement_message),\r\n      )\r\n      ```\r\n\r\n\r\n\r\n### Why are the changes needed?\r\n<!--\r\nPlease clarify why the changes are needed. For instance,\r\n  1. If you propose a new API, clarify the use case for a new API.\r\n  2. If you fix a bug, you can clarify why it is a bug.\r\n-->\r\n\r\nConsistency and reduced maintenance overhead.\r\n\r\n### Does this PR introduce _any_ user-facing change?\r\n<!--\r\nNote that it means *any* user-facing change including all aspects such as the documentation fix.\r\nIf yes, please clarify the previous behavior and the change this PR proposes - provide the console output, description and/or an example to show the behavior difference if possible.\r\nIf possible, please also clarify if this is a user-facing change compared to the released Spark versions or within the unreleased branches such as master.\r\nIf no, write 'No'.\r\n-->\r\n\r\nYes.\r\n\r\n### How was this patch tested?\r\n<!--\r\nIf tests were added, say they were added here. Please make sure to add some test cases that check the changes thoroughly including negative and positive cases if possible.\r\nIf it was tested in a way different from regular unit tests, please clarify how you tested step by step, ideally copy and paste-able, so that other reviewers can test and check, and descendants can verify in the future.\r\nIf tests were not added, please describe why they were not added and/or why it was difficult to add.\r\n-->\r\n\r\nExisting liners and tests.\r\n
ghprbPullTitle[WIP][SPARK-37022][PYTHON] Use black as a formatter for  PySpark
ghprbSourceBranchSPARK-37022
ghprbTargetBranchmaster
ghprbTriggerAuthor
ghprbTriggerAuthorEmail
ghprbTriggerAuthorLogin
ghprbTriggerAuthorLoginMention
sha1origin/pr/34297/merge