Spring Batch 集成

spring-doc.cadn.net.cn

XML Java Both

spring-doc.cadn.net.cn

Spring Batch 集成介绍

Spring Batch 的许多用户可能会遇到以下要求在 Spring Batch 的范围之外，但这可能是有效的，并且使用 Spring Integration 简洁地实现。相反，Spring 集成用户可能会遇到 Spring Batch 需求，需要一种方法有效地集成这两个框架。在这种情况下，几个模式和用例出现，Spring Batch 集成满足这些要求。spring-doc.cadn.net.cn

Spring Batch 和 Spring Integration 之间的界限并不总是很清楚，但有两条建议可以 help：考虑粒度，并应用常见模式。一些这些常见模式在本参考手册中进行了描述部分。spring-doc.cadn.net.cn

将消息传递添加到批处理中可实现运营，以及关键问题的分离和战略制定。例如，一条消息可能会触发要执行的作业，然后消息的发送可以通过多种方式公开。或者，当作业完成或失败，该事件可能会触发要发送的消息，这些消息的使用者可能有作方面的顾虑与应用程序本身无关。消息传递可以也可以嵌入到作业中（例如，读取或写入通过通道处理）。远程分区和远程分块提供在多个 worker 之间分配工作负载的方法。spring-doc.cadn.net.cn

本节涵盖以下关键概念：spring-doc.cadn.net.cn

命名空间支持

从 Spring Batch Integration 1.3 开始，专用的 XML 命名空间添加了支持，目的是提供更简单的配置经验。要激活命名空间，请添加以下内容命名空间声明添加到 Spring XML 应用程序上下文中文件：spring-doc.cadn.net.cn

<beans xmlns="http://www.springframework.org/schema/beans"
  xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
  xmlns:batch-int="http://www.springframework.org/schema/batch-integration"
  xsi:schemaLocation="
    http://www.springframework.org/schema/batch-integration
    https://www.springframework.org/schema/batch-integration/spring-batch-integration.xsd">

    ...

</beans>

一个完全配置的 Spring XML Application Context 文件批量集成可能如下所示：spring-doc.cadn.net.cn

<beans xmlns="http://www.springframework.org/schema/beans"
  xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
  xmlns:int="http://www.springframework.org/schema/integration"
  xmlns:batch="http://www.springframework.org/schema/batch"
  xmlns:batch-int="http://www.springframework.org/schema/batch-integration"
  xsi:schemaLocation="
    http://www.springframework.org/schema/batch-integration
    https://www.springframework.org/schema/batch-integration/spring-batch-integration.xsd
    http://www.springframework.org/schema/batch
    https://www.springframework.org/schema/batch/spring-batch.xsd
    http://www.springframework.org/schema/beans
    https://www.springframework.org/schema/beans/spring-beans.xsd
    http://www.springframework.org/schema/integration
    https://www.springframework.org/schema/integration/spring-integration.xsd">

    ...

</beans>

将版本号附加到引用的 XSD 文件也是 allowed，但是，作为无版本的声明，始终使用 latest 架构，我们通常不建议附加版本 number 设置为 XSD 名称。添加版本号可能会在更新 Spring Batch 时产生问题集成依赖项，因为它们可能需要更新的版本的 XML 架构。spring-doc.cadn.net.cn

通过消息启动 Batch 作业

使用核心 Spring Batch API 启动批处理作业时，您可以基本上有 2 个选项：spring-doc.cadn.net.cn

在命令行中，使用CommandLineJobRunnerspring-doc.cadn.net.cn
以编程方式，使用JobOperator.start()或JobLauncher.run()spring-doc.cadn.net.cn

例如，您可能希望使用CommandLineJobRunner调用 Batch Jobs 时使用 shell 脚本。或者，您可以使用JobOperator直接（例如，使用 Spring Batch 作为 Web 应用程序的一部分）。但是，那又如何呢更复杂的用例？也许您需要轮询远程（S）FTP server 检索 Batch Job 或应用程序的数据必须同时支持多个不同的数据源。为例如，您不仅可以从 Web 接收数据文件，还可以从 FTP 和其他来源。也许输入文件的其他转换是在调用 Spring Batch 之前需要。spring-doc.cadn.net.cn

因此，执行批处理作业会更强大使用 Spring Integration 及其众多适配器。例如您可以使用 File Inbound Channel Adapter 来监视文件系统中的目录，并以一旦输入文件到达。此外，您还可以创建 Spring 使用多个不同适配器的集成流从多个来源摄取批处理作业的数据同时使用 only configuration。实现所有这些 Spring Integration 的场景很容易，因为它允许解耦的、事件驱动的JobLauncher.spring-doc.cadn.net.cn

Spring Batch 集成提供了JobLaunchingMessageHandler类用于启动批处理作业。的JobLaunchingMessageHandler由 Spring Integration 消息，其有效负载为JobLaunchRequest.这个类是Job需要启动并围绕JobParameters启动 Batch 作业所必需的。spring-doc.cadn.net.cn

下图说明了典型的 Spring 集成消息流以启动 Batch 作业。EIP （Enterprise Integration Patterns）网站提供了消息传送图标及其描述的完整概述。spring-doc.cadn.net.cn

图 1.启动 Batch Job

将文件转换为 JobLaunchRequest

package io.spring.sbi;

import org.springframework.batch.core.Job;
import org.springframework.batch.core.JobParametersBuilder;
import org.springframework.batch.integration.launch.JobLaunchRequest;
import org.springframework.integration.annotation.Transformer;
import org.springframework.messaging.Message;

import java.io.File;

public class FileMessageToJobRequest {
    private Job job;
    private String fileParameterName;

    public void setFileParameterName(String fileParameterName) {
        this.fileParameterName = fileParameterName;
    }

    public void setJob(Job job) {
        this.job = job;
    }

    @Transformer
    public JobLaunchRequest toRequest(Message<File> message) {
        JobParametersBuilder jobParametersBuilder =
            new JobParametersBuilder();

        jobParametersBuilder.addString(fileParameterName,
            message.getPayload().getAbsolutePath());

        return new JobLaunchRequest(job, jobParametersBuilder.toJobParameters());
    }
}



The JobExecution Response

When a batch job is being executed, a
JobExecution instance is returned. This
instance can be used to determine the status of an execution. If
a JobExecution is able to be created
successfully, it is always returned, regardless of whether
or not the actual execution is successful.spring-doc.cadn.net.cn


The exact behavior on how the JobExecution
instance is returned depends on the provided
TaskExecutor. If a
synchronous (single-threaded)
TaskExecutor implementation is used, the
JobExecution response is returned only
after the job completes. When using an
asynchronous
TaskExecutor, the
JobExecution instance is returned
immediately. Users can then take the id of
JobExecution instance
(with JobExecution.getJobId()) and query the
JobRepository for the job’s updated status
using the JobExplorer. For more
information, please refer to the Spring
Batch reference documentation on
Querying the Repository.spring-doc.cadn.net.cn



Spring Batch Integration Configuration

Consider a case where someone needs to create a file inbound-channel-adapter to listen
for CSV files in the provided directory, hand them off to a transformer
(FileMessageToJobRequest), launch the job through the Job Launching Gateway, and then
log the output of the JobExecution with the logging-channel-adapter.spring-doc.cadn.net.cn


The following example shows how that common case can be configured in XML:spring-doc.cadn.net.cn


XML Configuration

<int:channel id="inboundFileChannel"/>
<int:channel id="outboundJobRequestChannel"/>
<int:channel id="jobLaunchReplyChannel"/>

<int-file:inbound-channel-adapter id="filePoller"
    channel="inboundFileChannel"
    directory="file:/tmp/myfiles/"
    filename-pattern="*.csv">
  <int:poller fixed-rate="1000"/>
</int-file:inbound-channel-adapter>

<int:transformer input-channel="inboundFileChannel"
    output-channel="outboundJobRequestChannel">
  <bean class="io.spring.sbi.FileMessageToJobRequest">
    <property name="job" ref="personJob"/>
    <property name="fileParameterName" value="input.file.name"/>
  </bean>
</int:transformer>

<batch-int:job-launching-gateway request-channel="outboundJobRequestChannel"
    reply-channel="jobLaunchReplyChannel"/>

<int:logging-channel-adapter channel="jobLaunchReplyChannel"/>



The following example shows how that common case can be configured in Java:spring-doc.cadn.net.cn


Java Configuration

@Bean
public FileMessageToJobRequest fileMessageToJobRequest() {
    FileMessageToJobRequest fileMessageToJobRequest = new FileMessageToJobRequest();
    fileMessageToJobRequest.setFileParameterName("input.file.name");
    fileMessageToJobRequest.setJob(personJob());
    return fileMessageToJobRequest;
}

@Bean
public JobLaunchingGateway jobLaunchingGateway() {
    SimpleJobLauncher simpleJobLauncher = new SimpleJobLauncher();
    simpleJobLauncher.setJobRepository(jobRepository);
    simpleJobLauncher.setTaskExecutor(new SyncTaskExecutor());
    JobLaunchingGateway jobLaunchingGateway = new JobLaunchingGateway(simpleJobLauncher);

    return jobLaunchingGateway;
}

@Bean
public IntegrationFlow integrationFlow(JobLaunchingGateway jobLaunchingGateway) {
    return IntegrationFlows.from(Files.inboundAdapter(new File("/tmp/myfiles")).
                    filter(new SimplePatternFileListFilter("*.csv")),
            c -> c.poller(Pollers.fixedRate(1000).maxMessagesPerPoll(1))).
            transform(fileMessageToJobRequest()).
            handle(jobLaunchingGateway).
            log(LoggingHandler.Level.WARN, "headers.id + ': ' + payload").
            get();
}





Example ItemReader Configuration

Now that we are polling for files and launching jobs, we need to configure our Spring
Batch ItemReader (for example) to use the files found at the location defined by the job
parameter called "input.file.name", as shown in the following bean configuration:spring-doc.cadn.net.cn


The following XML example shows the necessary bean configuration:spring-doc.cadn.net.cn


XML Configuration

<bean id="itemReader" class="org.springframework.batch.item.file.FlatFileItemReader"
    scope="step">
  <property name="resource" value="file://#{jobParameters['input.file.name']}"/>
    ...
</bean>



The following Java example shows the necessary bean configuration:spring-doc.cadn.net.cn


Java Configuration

@Bean
@StepScope
public ItemReader sampleReader(@Value("#{jobParameters[input.file.name]}") String resource) {
...
    FlatFileItemReader flatFileItemReader = new FlatFileItemReader();
    flatFileItemReader.setResource(new FileSystemResource(resource));
...
    return flatFileItemReader;
}




The main points of interest in the preceding example are injecting the value of
#{jobParameters['input.file.name']}
as the Resource property value and setting the ItemReader bean
to have Step scope. Setting the bean to have Step scope takes advantage of
the late binding support, which allows access to the
jobParameters variable.spring-doc.cadn.net.cn



Available Attributes of the Job-Launching Gateway

The job-launching gateway has the following attributes that you can set to control a job:spring-doc.cadn.net.cn




id: Identifies the underlying Spring bean definition, which is an instance of either:spring-doc.cadn.net.cn



EventDrivenConsumerspring-doc.cadn.net.cn


PollingConsumer
(The exact implementation depends on whether the component’s input channel is a
SubscribableChannel or PollableChannel.)spring-doc.cadn.net.cn





auto-startup: Boolean flag to indicate that the endpoint should start automatically on
startup. The default is true.spring-doc.cadn.net.cn


request-channel: The input MessageChannel of this endpoint.spring-doc.cadn.net.cn


reply-channel: MessageChannel to which the resulting JobExecution payload is sent.spring-doc.cadn.net.cn


reply-timeout: Lets you specify how long (in milliseconds) this gateway waits for the reply message
to be sent successfully to the reply channel before throwing
an exception. This attribute only applies when the channel
might block (for example, when using a bounded queue channel
that is currently full). Also, keep in mind that, when sending to a
DirectChannel, the invocation occurs
in the sender’s thread. Therefore, the failing of the send
operation may be caused by other components further downstream.
The reply-timeout attribute maps to the
sendTimeout property of the underlying
MessagingTemplate instance. If not specified, the attribute
defaults to<emphasis>-1</emphasis>,
meaning that, by default, the Gateway waits indefinitely.spring-doc.cadn.net.cn


job-launcher: Optional. Accepts a
custom
JobLauncher
bean reference.
If not specified the adapter
re-uses the instance that is registered under the id of
jobLauncher. If no default instance
exists, an exception is thrown.spring-doc.cadn.net.cn


order: Specifies the order of invocation when this endpoint is connected as a subscriber
to a SubscribableChannel.spring-doc.cadn.net.cn





Sub-Elements

When this Gateway is receiving messages from a
PollableChannel, you must either provide
a global default Poller or provide a Poller sub-element to the
Job Launching Gateway.spring-doc.cadn.net.cn


The following example shows how to provide a poller in XML:spring-doc.cadn.net.cn


XML Configuration

<batch-int:job-launching-gateway request-channel="queueChannel"
    reply-channel="replyChannel" job-launcher="jobLauncher">
  <int:poller fixed-rate="1000">
</batch-int:job-launching-gateway>



The following example shows how to provide a poller in Java:spring-doc.cadn.net.cn


Java Configuration

@Bean
@ServiceActivator(inputChannel = "queueChannel", poller = @Poller(fixedRate="1000"))
public JobLaunchingGateway sampleJobLaunchingGateway() {
    JobLaunchingGateway jobLaunchingGateway = new JobLaunchingGateway(jobLauncher());
    jobLaunchingGateway.setOutputChannel(replyChannel());
    return jobLaunchingGateway;
}




Providing Feedback with Informational Messages

As Spring Batch jobs can run for long times, providing progress
information is often critical. For example, stake-holders may want
to be notified if some or all parts of a batch job have failed.
Spring Batch provides support for this information being gathered
through:spring-doc.cadn.net.cn




Active pollingspring-doc.cadn.net.cn


Event-driven listenersspring-doc.cadn.net.cn




When starting a Spring Batch job asynchronously (for example, by using the Job Launching
Gateway), a JobExecution instance is returned. Thus, JobExecution.getJobId() can be
used to continuously poll for status updates by retrieving updated instances of the
JobExecution from the JobRepository by using the JobExplorer. However, this is
considered sub-optimal, and an event-driven approach should be preferred.spring-doc.cadn.net.cn


Therefore, Spring Batch provides listeners, including the three most commonly used
listeners:spring-doc.cadn.net.cn




StepListenerspring-doc.cadn.net.cn


ChunkListenerspring-doc.cadn.net.cn


JobExecutionListenerspring-doc.cadn.net.cn




In the example shown in the following image, a Spring Batch job has been configured with a
StepExecutionListener. Thus, Spring Integration receives and processes any step before
or after events. For example, the received StepExecution can be inspected by using a
Router. Based on the results of that inspection, various things can occur (such as
routing a message to a Mail Outbound Channel Adapter), so that an Email notification can
be sent out based on some condition.spring-doc.cadn.net.cn





Figure 2. Handling Informational Messages


The following two-part example shows how a listener is configured to send a
message to a Gateway for a StepExecution events and log its output to a
logging-channel-adapter.spring-doc.cadn.net.cn


First, create the notification integration beans.spring-doc.cadn.net.cn


The following example shows the how to create the notification integration beans in XML:spring-doc.cadn.net.cn


XML Configuration

<int:channel id="stepExecutionsChannel"/>

<int:gateway id="notificationExecutionsListener"
    service-interface="org.springframework.batch.core.StepExecutionListener"
    default-request-channel="stepExecutionsChannel"/>

<int:logging-channel-adapter channel="stepExecutionsChannel"/>



The following example shows the how to create the notification integration beans in Java:spring-doc.cadn.net.cn


Java Configuration

@Bean
@ServiceActivator(inputChannel = "stepExecutionsChannel")
public LoggingHandler loggingHandler() {
    LoggingHandler adapter = new LoggingHandler(LoggingHandler.Level.WARN);
    adapter.setLoggerName("TEST_LOGGER");
    adapter.setLogExpressionString("headers.id + ': ' + payload");
    return adapter;
}

@MessagingGateway(name = "notificationExecutionsListener", defaultRequestChannel = "stepExecutionsChannel")
public interface NotificationExecutionListener extends StepExecutionListener {}










You need to add the @IntegrationComponentScan annotation to your configuration.





Second, modify your job to add a step-level listener.spring-doc.cadn.net.cn


The following example shows the how to add a step-level listener in XML:spring-doc.cadn.net.cn


XML Configuration

<job id="importPayments">
    <step id="step1">
        <tasklet ../>
            <chunk ../>
            <listeners>
                <listener ref="notificationExecutionsListener"/>
            </listeners>
        </tasklet>
        ...
    </step>
</job>



The following example shows the how to add a step-level listener in Java:spring-doc.cadn.net.cn


Java Configuration

public Job importPaymentsJob() {
    return jobBuilderFactory.get("importPayments")
        .start(stepBuilderFactory.get("step1")
                .chunk(200)
                .listener(notificationExecutionsListener())
                ...
}





Asynchronous Processors

Asynchronous Processors help you to scale the processing of items. In the asynchronous
processor use case, an AsyncItemProcessor serves as a dispatcher, executing the logic of
the ItemProcessor for an item on a new thread. Once the item completes, the Future is
passed to the AsynchItemWriter to be written.spring-doc.cadn.net.cn


Therefore, you can increase performance by using asynchronous item processing, basically
letting you implement fork-join scenarios. The AsyncItemWriter gathers the results and
writes back the chunk as soon as all the results become available.spring-doc.cadn.net.cn


The following example shows how to configuration the AsyncItemProcessor in XML:spring-doc.cadn.net.cn


XML Configuration

<bean id="processor"
    class="org.springframework.batch.integration.async.AsyncItemProcessor">
  <property name="delegate">
    <bean class="your.ItemProcessor"/>
  </property>
  <property name="taskExecutor">
    <bean class="org.springframework.core.task.SimpleAsyncTaskExecutor"/>
  </property>
</bean>



The following example shows how to configuration the AsyncItemProcessor in XML:spring-doc.cadn.net.cn


Java Configuration

@Bean
public AsyncItemProcessor processor(ItemProcessor itemProcessor, TaskExecutor taskExecutor) {
    AsyncItemProcessor asyncItemProcessor = new AsyncItemProcessor();
    asyncItemProcessor.setTaskExecutor(taskExecutor);
    asyncItemProcessor.setDelegate(itemProcessor);
    return asyncItemProcessor;
}




The delegate property refers to your ItemProcessor bean, and the taskExecutor
property refers to the TaskExecutor of your choice.spring-doc.cadn.net.cn


The following example shows how to configure the AsyncItemWriter in XML:spring-doc.cadn.net.cn


XML Configuration

<bean id="itemWriter"
    class="org.springframework.batch.integration.async.AsyncItemWriter">
  <property name="delegate">
    <bean id="itemWriter" class="your.ItemWriter"/>
  </property>
</bean>



The following example shows how to configure the AsyncItemWriter in Java:spring-doc.cadn.net.cn


Java Configuration

@Bean
public AsyncItemWriter writer(ItemWriter itemWriter) {
    AsyncItemWriter asyncItemWriter = new AsyncItemWriter();
    asyncItemWriter.setDelegate(itemWriter);
    return asyncItemWriter;
}




Again, the delegate property is
actually a reference to your ItemWriter bean.spring-doc.cadn.net.cn



Externalizing Batch Process Execution

The integration approaches discussed so far suggest use cases
where Spring Integration wraps Spring Batch like an outer-shell.
However, Spring Batch can also use Spring Integration internally.
Using this approach, Spring Batch users can delegate the
processing of items or even chunks to outside processes. This
allows you to offload complex processing. Spring Batch Integration
provides dedicated support for:spring-doc.cadn.net.cn




Remote Chunkingspring-doc.cadn.net.cn


Remote Partitioningspring-doc.cadn.net.cn




Remote Chunking




Figure 3. Remote Chunking


Taking things one step further, one can also externalize the
chunk processing by using the
ChunkMessageChannelItemWriter
(provided by Spring Batch Integration), which sends items out
and collects the result. Once sent, Spring Batch continues the
process of reading and grouping items, without waiting for the results.
Rather, it is the responsibility of the ChunkMessageChannelItemWriter
to gather the results and integrate them back into the Spring Batch process.spring-doc.cadn.net.cn


With Spring Integration, you have full
control over the concurrency of your processes (for instance, by
using a QueueChannel instead of a
DirectChannel). Furthermore, by relying on
Spring Integration’s rich collection of Channel Adapters (such as
JMS and AMQP), you can distribute chunks of a Batch job to
external systems for processing.spring-doc.cadn.net.cn


A job with a step to be remotely chunked might have a configuration similar to the
following in XML:spring-doc.cadn.net.cn


XML Configuration

<job id="personJob">
  <step id="step1">
    <tasklet>
      <chunk reader="itemReader" writer="itemWriter" commit-interval="200"/>
    </tasklet>
    ...
  </step>
</job>



A job with a step to be remotely chunked might have a configuration similar to the
following in Java:spring-doc.cadn.net.cn


Java Configuration

public Job chunkJob() {
     return jobBuilderFactory.get("personJob")
             .start(stepBuilderFactory.get("step1")
                     .<Person, Person>chunk(200)
                     .reader(itemReader())
                     .writer(itemWriter())
                     .build())
             .build();
 }




The ItemReader reference points to the bean you want to use for reading data on the
manager. The ItemWriter reference points to a special ItemWriter (called
ChunkMessageChannelItemWriter), as described above. The processor (if any) is left off
the manager configuration, as it is configured on the worker. You should check any
additional component properties, such  as throttle limits and so on, when implementing
your use case.spring-doc.cadn.net.cn


The following XML configuration provides a basic manager setup:spring-doc.cadn.net.cn


XML Configuration

<bean id="connectionFactory" class="org.apache.activemq.ActiveMQConnectionFactory">
  <property name="brokerURL" value="tcp://localhost:61616"/>
</bean>

<int-jms:outbound-channel-adapter id="jmsRequests" destination-name="requests"/>

<bean id="messagingTemplate"
    class="org.springframework.integration.core.MessagingTemplate">
  <property name="defaultChannel" ref="requests"/>
  <property name="receiveTimeout" value="2000"/>
</bean>

<bean id="itemWriter"
    class="org.springframework.batch.integration.chunk.ChunkMessageChannelItemWriter"
    scope="step">
  <property name="messagingOperations" ref="messagingTemplate"/>
  <property name="replyChannel" ref="replies"/>
</bean>

<int:channel id="replies">
  <int:queue/>
</int:channel>

<int-jms:message-driven-channel-adapter id="jmsReplies"
    destination-name="replies"
    channel="replies"/>



The following Java configuration  provides a basic manager setup:spring-doc.cadn.net.cn


Java Configuration

@Bean
public org.apache.activemq.ActiveMQConnectionFactory connectionFactory() {
    ActiveMQConnectionFactory factory = new ActiveMQConnectionFactory();
    factory.setBrokerURL("tcp://localhost:61616");
    return factory;
}

/*
 * Configure outbound flow (requests going to workers)
 */
@Bean
public DirectChannel requests() {
    return new DirectChannel();
}

@Bean
public IntegrationFlow outboundFlow(ActiveMQConnectionFactory connectionFactory) {
    return IntegrationFlows
            .from(requests())
            .handle(Jms.outboundAdapter(connectionFactory).destination("requests"))
            .get();
}

/*
 * Configure inbound flow (replies coming from workers)
 */
@Bean
public QueueChannel replies() {
    return new QueueChannel();
}

@Bean
public IntegrationFlow inboundFlow(ActiveMQConnectionFactory connectionFactory) {
    return IntegrationFlows
            .from(Jms.messageDrivenChannelAdapter(connectionFactory).destination("replies"))
            .channel(replies())
            .get();
}

/*
 * Configure the ChunkMessageChannelItemWriter
 */
@Bean
public ItemWriter<Integer> itemWriter() {
    MessagingTemplate messagingTemplate = new MessagingTemplate();
    messagingTemplate.setDefaultChannel(requests());
    messagingTemplate.setReceiveTimeout(2000);
    ChunkMessageChannelItemWriter<Integer> chunkMessageChannelItemWriter
            = new ChunkMessageChannelItemWriter<>();
    chunkMessageChannelItemWriter.setMessagingOperations(messagingTemplate);
    chunkMessageChannelItemWriter.setReplyChannel(replies());
    return chunkMessageChannelItemWriter;
}




The preceding configuration provides us with a number of beans. We
configure our messaging middleware using ActiveMQ and the
inbound/outbound JMS adapters provided by Spring Integration. As
shown, our itemWriter bean, which is
referenced by our job step, uses the
ChunkMessageChannelItemWriter for writing chunks over the
configured middleware.spring-doc.cadn.net.cn


Now we can move on to the worker configuration, as shown in the following example:spring-doc.cadn.net.cn


The following example shows the worker configuration in XML:spring-doc.cadn.net.cn


XML Configuration

<bean id="connectionFactory" class="org.apache.activemq.ActiveMQConnectionFactory">
  <property name="brokerURL" value="tcp://localhost:61616"/>
</bean>

<int:channel id="requests"/>
<int:channel id="replies"/>

<int-jms:message-driven-channel-adapter id="incomingRequests"
    destination-name="requests"
    channel="requests"/>

<int-jms:outbound-channel-adapter id="outgoingReplies"
    destination-name="replies"
    channel="replies">
</int-jms:outbound-channel-adapter>

<int:service-activator id="serviceActivator"
    input-channel="requests"
    output-channel="replies"
    ref="chunkProcessorChunkHandler"
    method="handleChunk"/>

<bean id="chunkProcessorChunkHandler"
    class="org.springframework.batch.integration.chunk.ChunkProcessorChunkHandler">
  <property name="chunkProcessor">
    <bean class="org.springframework.batch.core.step.item.SimpleChunkProcessor">
      <property name="itemWriter">
        <bean class="io.spring.sbi.PersonItemWriter"/>
      </property>
      <property name="itemProcessor">
        <bean class="io.spring.sbi.PersonItemProcessor"/>
      </property>
    </bean>
  </property>
</bean>



The following example shows the worker configuration in Java:spring-doc.cadn.net.cn


Java Configuration

@Bean
public org.apache.activemq.ActiveMQConnectionFactory connectionFactory() {
    ActiveMQConnectionFactory factory = new ActiveMQConnectionFactory();
    factory.setBrokerURL("tcp://localhost:61616");
    return factory;
}

/*
 * Configure inbound flow (requests coming from the manager)
 */
@Bean
public DirectChannel requests() {
    return new DirectChannel();
}

@Bean
public IntegrationFlow inboundFlow(ActiveMQConnectionFactory connectionFactory) {
    return IntegrationFlows
            .from(Jms.messageDrivenChannelAdapter(connectionFactory).destination("requests"))
            .channel(requests())
            .get();
}

/*
 * Configure outbound flow (replies going to the manager)
 */
@Bean
public DirectChannel replies() {
    return new DirectChannel();
}

@Bean
public IntegrationFlow outboundFlow(ActiveMQConnectionFactory connectionFactory) {
    return IntegrationFlows
            .from(replies())
            .handle(Jms.outboundAdapter(connectionFactory).destination("replies"))
            .get();
}

/*
 * Configure the ChunkProcessorChunkHandler
 */
@Bean
@ServiceActivator(inputChannel = "requests", outputChannel = "replies")
public ChunkProcessorChunkHandler<Integer> chunkProcessorChunkHandler() {
    ChunkProcessor<Integer> chunkProcessor
            = new SimpleChunkProcessor<>(itemProcessor(), itemWriter());
    ChunkProcessorChunkHandler<Integer> chunkProcessorChunkHandler
            = new ChunkProcessorChunkHandler<>();
    chunkProcessorChunkHandler.setChunkProcessor(chunkProcessor);
    return chunkProcessorChunkHandler;
}




Most of these configuration items should look familiar from the
manager configuration. Workers do not need access to
the Spring Batch JobRepository nor
to the actual job configuration file. The main bean of interest
is the chunkProcessorChunkHandler. The
chunkProcessor property of ChunkProcessorChunkHandler takes a
configured SimpleChunkProcessor, which is where you would provide a reference to your
ItemWriter (and, optionally, your
ItemProcessor) that will run on the worker
when it receives chunks from the manager.spring-doc.cadn.net.cn


For more information, see the section of the "Scalability" chapter on
Remote Chunking.spring-doc.cadn.net.cn


Starting from version 4.1, Spring Batch Integration introduces the @EnableBatchIntegration
annotation that can be used to simplify a remote chunking setup. This annotation provides
two beans that can be autowired in the application context:spring-doc.cadn.net.cn




RemoteChunkingManagerStepBuilderFactory: used to configure the manager stepspring-doc.cadn.net.cn


RemoteChunkingWorkerBuilder: used to configure the remote worker integration flowspring-doc.cadn.net.cn




These APIs take care of configuring a number of components as described in the following diagram:spring-doc.cadn.net.cn





Figure 4. Remote Chunking Configuration


On the manager side, the RemoteChunkingManagerStepBuilderFactory lets you
configure a manager step by declaring:spring-doc.cadn.net.cn




the item reader to read items and send them to workersspring-doc.cadn.net.cn


the output channel ("Outgoing requests") to send requests to workersspring-doc.cadn.net.cn


the input channel ("Incoming replies") to receive replies from workersspring-doc.cadn.net.cn




A ChunkMessageChannelItemWriter and the MessagingTemplate are not needed to be explicitly configured
(Those can still be explicitly configured if required).spring-doc.cadn.net.cn


On the worker side, the RemoteChunkingWorkerBuilder allows you to configure a worker to:spring-doc.cadn.net.cn




listen to requests sent by the manager on the input channel ("Incoming requests")spring-doc.cadn.net.cn


call the handleChunk method of ChunkProcessorChunkHandler for each request
with the configured ItemProcessor and ItemWriterspring-doc.cadn.net.cn


send replies on the output channel ("Outgoing replies") to the managerspring-doc.cadn.net.cn




There is no need to explicitly configure the SimpleChunkProcessor
and the ChunkProcessorChunkHandler (Those can be explicitly configured if required).spring-doc.cadn.net.cn


The following example shows how to use these APIs:spring-doc.cadn.net.cn



@EnableBatchIntegration
@EnableBatchProcessing
public class RemoteChunkingJobConfiguration {

    @Configuration
    public static class ManagerConfiguration {

        @Autowired
        private RemoteChunkingManagerStepBuilderFactory managerStepBuilderFactory;

        @Bean
        public TaskletStep managerStep() {
            return this.managerStepBuilderFactory.get("managerStep")
                       .chunk(100)
                       .reader(itemReader())
                       .outputChannel(requests()) // requests sent to workers
                       .inputChannel(replies())   // replies received from workers
                       .build();
        }

        // Middleware beans setup omitted

    }

    @Configuration
    public static class WorkerConfiguration {

        @Autowired
        private RemoteChunkingWorkerBuilder workerBuilder;

        @Bean
        public IntegrationFlow workerFlow() {
            return this.workerBuilder
                       .itemProcessor(itemProcessor())
                       .itemWriter(itemWriter())
                       .inputChannel(requests()) // requests received from the manager
                       .outputChannel(replies()) // replies sent to the manager
                       .build();
        }

        // Middleware beans setup omitted

    }

}




You can find a complete example of a remote chunking job
here.spring-doc.cadn.net.cn



Remote Partitioning




Figure 5. Remote Partitioning


Remote Partitioning, on the other hand, is useful when it
is not the processing of items but rather the associated I/O that
causes the bottleneck. Using Remote Partitioning, work can
be farmed out to workers that execute complete Spring Batch
steps. Thus, each worker has its own ItemReader, ItemProcessor, and
ItemWriter. For this purpose, Spring Batch
Integration provides the MessageChannelPartitionHandler.spring-doc.cadn.net.cn


This implementation of the PartitionHandler
interface uses MessageChannel instances to
send instructions to remote workers and receive their responses.
This provides a nice abstraction from the transports (such as JMS
and AMQP) being used to communicate with the remote workers.spring-doc.cadn.net.cn


The section of the "Scalability" chapter that addresses
remote partitioning provides an overview of the concepts and
components needed to configure remote partitioning and shows an
example of using the default
TaskExecutorPartitionHandler to partition
in separate local threads of execution. For remote partitioning
to multiple JVMs, two additional components are required:spring-doc.cadn.net.cn




A remoting fabric or grid environmentspring-doc.cadn.net.cn


A PartitionHandler implementation that supports the desired
remoting fabric or grid environmentspring-doc.cadn.net.cn




Similar to remote chunking, JMS can be used as the “remoting fabric”. In that case, use
a MessageChannelPartitionHandler instance as the PartitionHandler implementation,
as described earlier.spring-doc.cadn.net.cn


The following example assumes an existing partitioned job and focuses on the
MessageChannelPartitionHandler and JMS configuration in XML:spring-doc.cadn.net.cn


XML Configuration

<bean id="partitionHandler"
   class="org.springframework.batch.integration.partition.MessageChannelPartitionHandler">
  <property name="stepName" value="step1"/>
  <property name="gridSize" value="3"/>
  <property name="replyChannel" ref="outbound-replies"/>
  <property name="messagingOperations">
    <bean class="org.springframework.integration.core.MessagingTemplate">
      <property name="defaultChannel" ref="outbound-requests"/>
      <property name="receiveTimeout" value="100000"/>
    </bean>
  </property>
</bean>

<int:channel id="outbound-requests"/>
<int-jms:outbound-channel-adapter destination="requestsQueue"
    channel="outbound-requests"/>

<int:channel id="inbound-requests"/>
<int-jms:message-driven-channel-adapter destination="requestsQueue"
    channel="inbound-requests"/>

<bean id="stepExecutionRequestHandler"
    class="org.springframework.batch.integration.partition.StepExecutionRequestHandler">
  <property name="jobExplorer" ref="jobExplorer"/>
  <property name="stepLocator" ref="stepLocator"/>
</bean>

<int:service-activator ref="stepExecutionRequestHandler" input-channel="inbound-requests"
    output-channel="outbound-staging"/>

<int:channel id="outbound-staging"/>
<int-jms:outbound-channel-adapter destination="stagingQueue"
    channel="outbound-staging"/>

<int:channel id="inbound-staging"/>
<int-jms:message-driven-channel-adapter destination="stagingQueue"
    channel="inbound-staging"/>

<int:aggregator ref="partitionHandler" input-channel="inbound-staging"
    output-channel="outbound-replies"/>

<int:channel id="outbound-replies">
  <int:queue/>
</int:channel>

<bean id="stepLocator"
    class="org.springframework.batch.integration.partition.BeanFactoryStepLocator" />



The following example assumes an existing partitioned job and focuses on the
MessageChannelPartitionHandler and JMS configuration in Java:spring-doc.cadn.net.cn


Java Configuration

/*
 * Configuration of the manager side
 */
@Bean
public PartitionHandler partitionHandler() {
    MessageChannelPartitionHandler partitionHandler = new MessageChannelPartitionHandler();
    partitionHandler.setStepName("step1");
    partitionHandler.setGridSize(3);
    partitionHandler.setReplyChannel(outboundReplies());
    MessagingTemplate template = new MessagingTemplate();
    template.setDefaultChannel(outboundRequests());
    template.setReceiveTimeout(100000);
    partitionHandler.setMessagingOperations(template);
    return partitionHandler;
}

@Bean
public QueueChannel outboundReplies() {
    return new QueueChannel();
}

@Bean
public DirectChannel outboundRequests() {
    return new DirectChannel();
}

@Bean
public IntegrationFlow outboundJmsRequests() {
    return IntegrationFlows.from("outboundRequests")
            .handle(Jms.outboundGateway(connectionFactory())
                    .requestDestination("requestsQueue"))
            .get();
}

@Bean
@ServiceActivator(inputChannel = "inboundStaging")
public AggregatorFactoryBean partitioningMessageHandler() throws Exception {
    AggregatorFactoryBean aggregatorFactoryBean = new AggregatorFactoryBean();
    aggregatorFactoryBean.setProcessorBean(partitionHandler());
    aggregatorFactoryBean.setOutputChannel(outboundReplies());
    // configure other propeties of the aggregatorFactoryBean
    return aggregatorFactoryBean;
}

@Bean
public DirectChannel inboundStaging() {
    return new DirectChannel();
}

@Bean
public IntegrationFlow inboundJmsStaging() {
    return IntegrationFlows
            .from(Jms.messageDrivenChannelAdapter(connectionFactory())
                    .configureListenerContainer(c -> c.subscriptionDurable(false))
                    .destination("stagingQueue"))
            .channel(inboundStaging())
            .get();
}

/*
 * Configuration of the worker side
 */
@Bean
public StepExecutionRequestHandler stepExecutionRequestHandler() {
    StepExecutionRequestHandler stepExecutionRequestHandler = new StepExecutionRequestHandler();
    stepExecutionRequestHandler.setJobExplorer(jobExplorer);
    stepExecutionRequestHandler.setStepLocator(stepLocator());
    return stepExecutionRequestHandler;
}

@Bean
@ServiceActivator(inputChannel = "inboundRequests", outputChannel = "outboundStaging")
public StepExecutionRequestHandler serviceActivator() throws Exception {
    return stepExecutionRequestHandler();
}

@Bean
public DirectChannel inboundRequests() {
    return new DirectChannel();
}

public IntegrationFlow inboundJmsRequests() {
    return IntegrationFlows
            .from(Jms.messageDrivenChannelAdapter(connectionFactory())
                    .configureListenerContainer(c -> c.subscriptionDurable(false))
                    .destination("requestsQueue"))
            .channel(inboundRequests())
            .get();
}

@Bean
public DirectChannel outboundStaging() {
    return new DirectChannel();
}

@Bean
public IntegrationFlow outboundJmsStaging() {
    return IntegrationFlows.from("outboundStaging")
            .handle(Jms.outboundGateway(connectionFactory())
                    .requestDestination("stagingQueue"))
            .get();
}




You must also ensure that the partition handler attribute maps to the partitionHandler
bean.spring-doc.cadn.net.cn


The following example maps the partition handler attribute to the partitionHandler in
XML:spring-doc.cadn.net.cn


XML Configuration

<job id="personJob">
  <step id="step1.manager">
    <partition partitioner="partitioner" handler="partitionHandler"/>
    ...
  </step>
</job>



The following example maps the partition handler attribute to the partitionHandler in
Java:spring-doc.cadn.net.cn


Java Configuration

	public Job personJob() {
		return jobBuilderFactory.get("personJob")
				.start(stepBuilderFactory.get("step1.manager")
						.partitioner("step1.worker", partitioner())
						.partitionHandler(partitionHandler())
						.build())
				.build();
	}




You can find a complete example of a remote partitioning job
here.spring-doc.cadn.net.cn


The @EnableBatchIntegration annotation that can be used to simplify a remote
 partitioning setup. This annotation provides two beans useful for remote partitioning:spring-doc.cadn.net.cn




RemotePartitioningManagerStepBuilderFactory: used to configure the manager stepspring-doc.cadn.net.cn


RemotePartitioningWorkerStepBuilderFactory: used to configure the worker stepspring-doc.cadn.net.cn




These APIs take care of configuring a number of components as described in the following diagram:spring-doc.cadn.net.cn





Figure 6. Remote Partitioning Configuration (with job repository polling)





Figure 7. Remote Partitioning Configuration (with replies aggregation)


On the manager side, the RemotePartitioningManagerStepBuilderFactory allows you to
configure a manager step by declaring:spring-doc.cadn.net.cn




the Partitioner used to partition dataspring-doc.cadn.net.cn


the output channel ("Outgoing requests") to send requests to workersspring-doc.cadn.net.cn


the input channel ("Incoming replies") to receive replies from workers (when configuring replies aggregation)spring-doc.cadn.net.cn


the poll interval and timeout parameters (when configuring job repository polling)spring-doc.cadn.net.cn




The MessageChannelPartitionHandler and the MessagingTemplate are not needed to be explicitly configured
(Those can still be explicitly configured if required).spring-doc.cadn.net.cn


On the worker side, the RemotePartitioningWorkerStepBuilderFactory allows you to configure a worker to:spring-doc.cadn.net.cn




listen to requests sent by the manager on the input channel ("Incoming requests")spring-doc.cadn.net.cn


call the handle method of StepExecutionRequestHandler for each requestspring-doc.cadn.net.cn


send replies on the output channel ("Outgoing replies") to the managerspring-doc.cadn.net.cn




There is no need to explicitly configure the StepExecutionRequestHandler (which can be explicitly configured if required).spring-doc.cadn.net.cn


The following example shows how to use these APIs:spring-doc.cadn.net.cn



@Configuration
@EnableBatchProcessing
@EnableBatchIntegration
public class RemotePartitioningJobConfiguration {

    @Configuration
    public static class ManagerConfiguration {

        @Autowired
        private RemotePartitioningManagerStepBuilderFactory managerStepBuilderFactory;

        @Bean
        public Step managerStep() {
                 return this.managerStepBuilderFactory
                    .get("managerStep")
                    .partitioner("workerStep", partitioner())
                    .gridSize(10)
                    .outputChannel(outgoingRequestsToWorkers())
                    .inputChannel(incomingRepliesFromWorkers())
                    .build();
        }

        // Middleware beans setup omitted

    }

    @Configuration
    public static class WorkerConfiguration {

        @Autowired
        private RemotePartitioningWorkerStepBuilderFactory workerStepBuilderFactory;

        @Bean
        public Step workerStep() {
                 return this.workerStepBuilderFactory
                    .get("workerStep")
                    .inputChannel(incomingRequestsFromManager())
                    .outputChannel(outgoingRepliesToManager())
                    .chunk(100)
                    .reader(itemReader())
                    .processor(itemProcessor())
                    .writer(itemWriter())
                    .build();
        }

        // Middleware beans setup omitted

    }

}