Skip to main content

GetSFTP

Description

Fetches files from an SFTP Server and creates FlowFiles from them

Tags

fetch, files, get, ingest, input, remote, retrieve, sftp, source

Properties

In the list below required Properties are shown with an asterisk (*). Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

Display NameAPI NameDefault ValueAllowable ValuesDescription
Hostname *HostnameThe fully qualified hostname or IP address of the remote system

Supports Expression Language, using FlowFile attributes and Environment variables.
Port *Port22The port that the remote system is listening on for file transfers

Supports Expression Language, using FlowFile attributes and Environment variables.
Username *UsernameUsername

Supports Expression Language, using FlowFile attributes and Environment variables.
PasswordPasswordPassword for the user account

Supports Expression Language, using FlowFile attributes and Environment variables.
Private Key PathPrivate Key PathThe fully qualified path to the Private Key file

Supports Expression Language, using FlowFile attributes and Environment variables.
Private Key PassphrasePrivate Key PassphrasePassword for the private key

Supports Expression Language, using FlowFile attributes and Environment variables.
Remote PathRemote PathThe path on the remote system from which to pull or push files

Supports Expression Language, using FlowFile attributes and Environment variables.
File Filter RegexFile Filter RegexProvides a Java Regular Expression for filtering Filenames; if a filter is supplied, only files whose names match that Regular Expression will be fetched
Path Filter RegexPath Filter RegexWhen Search Recursively is true, then only subdirectories whose path matches the given Regular Expression will be scanned
Polling Interval *Polling Interval60 secDetermines how long to wait between fetching the listing for new files
Search Recursively *Search Recursivelyfalse
  • true
  • false
If true, will pull files from arbitrarily nested subdirectories; otherwise, will not traverse subdirectories
Follow symlink *follow-symlinkfalse
  • true
  • false
If true, will pull even symbolic files and also nested symbolic subdirectories; otherwise, will not read symbolic files and will not traverse symbolic link subdirectories
Ignore Dotted Files *Ignore Dotted Filestrue
  • true
  • false
If true, files whose names begin with a dot (".") will be ignored
Delete Original *Delete Originaltrue
  • true
  • false
Determines whether or not the file is deleted from the remote system after it has been successfully transferred
Connection Timeout *Connection Timeout30 secAmount of time to wait before timing out while creating a connection
Data Timeout *Data Timeout30 secWhen transferring a file between the local and remote system, this value specifies how long is allowed to elapse without any data being transferred between systems
Host Key FileHost Key FileIf supplied, the given file will be used as the Host Key; otherwise, if 'Strict Host Key Checking' property is applied (set to true) then uses the 'known_hosts' and 'known_hosts2' files from ~/.ssh directory else no host key file will be used
Max Selects *Max Selects100The maximum number of files to pull in a single connection
Remote Poll Batch Size *Remote Poll Batch Size5000The value specifies how many file paths to find in a given directory on the remote system when doing a file listing. This value in general should not need to be modified but when polling against a remote system with a tremendous number of files this value can be critical. Setting this value too high can result very poor performance and setting it too low can cause the flow to be slower than normal.
Strict Host Key Checking *Strict Host Key Checkingfalse
  • true
  • false
Indicates whether or not strict enforcement of hosts keys should be applied
Send Keep Alive On Timeout *Send Keep Alive On Timeouttrue
  • true
  • false
Send a Keep Alive message every 5 seconds up to 5 times for an overall timeout of 25 seconds.
Use Compression *Use Compressionfalse
  • true
  • false
Indicates whether or not ZLIB compression should be used when transferring files
Use Natural Ordering *Use Natural Orderingfalse
  • true
  • false
If true, will pull files in the order in which they are naturally listed; otherwise, the order in which the files will be pulled is not defined
Proxy Configuration Serviceproxy-configuration-serviceController Service:
ProxyConfigurationService

Implementations:
StandardProxyConfigurationService
Specifies the Proxy Configuration Controller Service to proxy network requests. Supported proxies: HTTP + AuthN, SOCKS + AuthN
Ciphers AllowedCiphers AllowedA comma-separated list of Ciphers allowed for SFTP connections. Leave unset to allow all. Available options are: 3des-cbc, 3des-ctr, aes128-cbc, aes128-ctr, aes128-gcm@openssh.com, aes192-cbc, aes192-ctr, aes256-cbc, aes256-ctr, aes256-gcm@openssh.com, arcfour, arcfour128, arcfour256, blowfish-cbc, blowfish-ctr, cast128-cbc, cast128-ctr, chacha20-poly1305@openssh.com, idea-cbc, idea-ctr, serpent128-cbc, serpent128-ctr, serpent192-cbc, serpent192-ctr, serpent256-cbc, serpent256-ctr, twofish-cbc, twofish128-cbc, twofish128-ctr, twofish192-cbc, twofish192-ctr, twofish256-cbc, twofish256-ctr

Supports Expression Language, using Environment variables.
Key Algorithms AllowedKey Algorithms AllowedA comma-separated list of Key Algorithms allowed for SFTP connections. Leave unset to allow all. Available options are: ecdsa-sha2-nistp256, ecdsa-sha2-nistp256-cert-v01@openssh.com, ecdsa-sha2-nistp384, ecdsa-sha2-nistp384-cert-v01@openssh.com, ecdsa-sha2-nistp521, ecdsa-sha2-nistp521-cert-v01@openssh.com, rsa-sha2-256, rsa-sha2-512, ssh-dss, ssh-dss-cert-v01@openssh.com, ssh-ed25519, ssh-ed25519-cert-v01@openssh.com, ssh-rsa, ssh-rsa-cert-v01@openssh.com

Supports Expression Language, using Environment variables.
Key Exchange Algorithms AllowedKey Exchange Algorithms AllowedA comma-separated list of Key Exchange Algorithms allowed for SFTP connections. Leave unset to allow all. Available options are: curve25519-sha256, curve25519-sha256@libssh.org, diffie-hellman-group-exchange-sha1, diffie-hellman-group-exchange-sha256, diffie-hellman-group1-sha1, diffie-hellman-group14-sha1, diffie-hellman-group14-sha256, diffie-hellman-group14-sha256@ssh.com, diffie-hellman-group15-sha256, diffie-hellman-group15-sha256@ssh.com, diffie-hellman-group15-sha384@ssh.com, diffie-hellman-group15-sha512, diffie-hellman-group16-sha256, diffie-hellman-group16-sha384@ssh.com, diffie-hellman-group16-sha512, diffie-hellman-group16-sha512@ssh.com, diffie-hellman-group17-sha512, diffie-hellman-group18-sha512, diffie-hellman-group18-sha512@ssh.com, ecdh-sha2-nistp256, ecdh-sha2-nistp384, ecdh-sha2-nistp521, ext-info-c

Supports Expression Language, using Environment variables.
Message Authentication Codes AllowedMessage Authentication Codes AllowedA comma-separated list of Message Authentication Codes allowed for SFTP connections. Leave unset to allow all. Available options are: hmac-md5, hmac-md5-96, hmac-md5-96-etm@openssh.com, hmac-md5-etm@openssh.com, hmac-ripemd160, hmac-ripemd160-96, hmac-ripemd160-etm@openssh.com, hmac-ripemd160@openssh.com, hmac-sha1, hmac-sha1-96, hmac-sha1-96@openssh.com, hmac-sha1-etm@openssh.com, hmac-sha2-256, hmac-sha2-256-etm@openssh.com, hmac-sha2-512, hmac-sha2-512-etm@openssh.com

Supports Expression Language, using Environment variables.

Dynamic Properties

This component does not support dynamic properties.

Relationships

NameDescription
successAll FlowFiles that are received are routed to success

Reads Attributes

This processor does not read attributes.

Writes Attributes

NameDescription
absolute.pathThe full/absolute path from where a file was picked up. The current 'path' attribute is still populated, but may be a relative path
file.groupThe numeric group id of the source file
file.lastModifiedTimeThe date and time that the source file was last modified
file.ownerThe numeric owner id of the source file
file.permissionsThe read/write/execute permissions of the source file
filenameThe filename is set to the name of the file on the remote server
pathThe path is set to the path of the file's directory on the remote server. For example, if the <Remote Path> property is set to /tmp, files picked up from /tmp will have the path attribute set to /tmp. If the <Search Recursively> property is set to true and a file is picked up from /tmp/abc/1/2/3, then the path attribute will be set to /tmp/abc/1/2/3

State Management

This component does not store state.

Restricted

This component is not restricted.

Input Requirement

This component does not allow an incoming relationship.

System Resource Considerations

This component does not specify system resource considerations.

See Also

PutSFTP