azkaban-aplcache

AZNewDispatchingLogic - Create ExecutionController module …

12/3/2018 9:44:53 PM

(#2039)

* AZNewDispatchingLogic_CreateNewModule

* address comment

Jamie Sun

Commit: 8656c00

Tree: 1b20d02

Parents: d2d87ef

Fix bug: queued flows included in active flows (#2043) Bug …

12/3/2018 3:58:15 PM

was introduced in #1833

Juho Autio

Commit: d2d87ef

Tree: 34ad448

Parents: a1b0978

Fix mismatch in name of job log files (#2042) * Fix comments …

11/29/2018 8:17:40 PM

in CommonJobProperties.

* Fix mismatch in name of job log files when trying to kill Hadoop/Spark jobs. Issue: when a user tries to kill a Hadoop/Spark job within an embedded flow the application looks for a log file using the following name pattern "_job.<exec_id>.<job_id>.log" no matter whether the job was embedded or not. But actually if a job is inside an embedded flow the name format used to create its log file is: "job.<exec_id>.<parent_flow>..<job_id>.log". So trying to kill a job inside an embedded flow before would fail because of the log file name mismatch.

* Fix according to review comments

Yeni Bermudez

Commit: a1b0978

Tree: 3b33e8d

Parents: f24ead0

file count sanity check (#2032) follow-up of #2017

11/28/2018 11:08:05 PM

Cheng Ren

Commit: f24ead0

Tree: bdfa2d9

Parents: 35382d6

#1947 wrong 'Host' header in http callbacks deleted. (#1961) Azkaban …

11/27/2018 11:36:05 PM

provide the ability to call HTTP resources as described in 'HTTP Job Callback' https://azkaban.github.io/azkaban/docs/latest/#common-configurations
job.notification.started.1.url=.. and etc

In the implementation https://github.com/azkaban/azkaban/blob/master/azkaban-exec-server/src/main/java/azkaban/execapp/event/JobCallbackManager.java#L263 'azkabanHostName' used as the value of 'Host' header. But the value should be the hostname of requested HTTP resource, not Azkaban's host.

For example for https://hooks.slack.com/services should be Host:hooks.slack.com

Vladislav Sidorovich

Commit: 35382d6

Tree: ce28b5b

Parents: 4f5f813

Remove unused methods in ExecutorManager. (#2036)

11/26/2018 5:02:30 PM

Jamie Sun

Commit: 4f5f813

Tree: f8a2ca2

Parents: 4f2f631

Remove unused method from HadoopSecurityManager (#2030)

11/19/2018 10:39:41 PM

Cheng Ren

Commit: 4f2f631

Tree: 1c5b200

Parents: c5f418b

Minor CSS fixes and JS improvements (#2031)

11/19/2018 4:16:16 PM

Yeni Bermudez

Commit: c5f418b

Tree: 836b910

Parents: 5e0b90b

Improve dispatch request handling of a previously submitted …

11/16/2018 11:27:44 PM

execution (#2023)

* Extract method submitFlowRunner

* Extract method createFlowRunner

* Improve dispatch request handling of a previously submitted execution

- If the execution is indeed running, return OK so that dispatcher knows that it was successfully dispatched
- If the execution was left in some intermediate state, return an error so that dispatcher knows to retry or finalize the execution as failed

Juho Autio

Commit: 5e0b90b

Tree: 61e431d

Parents: bcbb639

Refactor hadoop token fetch logic follow-up (#2028) This …

11/15/2018 11:45:28 PM

PR:
1. adds more logging and standardize existing logging for each token prefetching methods so that we know prefetching from which service is stuck.

2. removes "synchronized" for doPrefetch(HadoopSecurityManager_H_2_0#doPrefetch). Current design makes it hard to debug which token service the job is stuck with fetching token. Since HadoopSecurityManager_H_2_0 is shared by all jobs in the executor, if one job is stuck with fetching token with a problematic token service, all other jobs will be blocked from entering into this synchronized method. It's impossible to infer which token service jobs are stuck with from job logs as they are just waiting for one job to finish fetching token.

Cheng Ren

Commit: bcbb639

Tree: 06cc412

Parents: 106d177