Question 1

What is a FIX engine and what does it do?

Accepted Answer

A FIX engine is the software component responsible for managing the FIX session protocol: establishing and maintaining TCP/IP connections to counterparties, generating and tracking sequence numbers for every outbound message, sending heartbeat messages (35=0) at configured intervals to detect connectivity failures, and handling message recovery via ResendRequest (35=2) when sequence gaps are detected. The FIX engine sits between a firm's trading application — OMS, EMS, or risk system — and the counterparty's acceptor. It handles all session-level mechanics so that the trading application only needs to construct and consume application-layer FIX messages (orders, ExecutionReports, allocations) without managing connection state directly. A FIX engine failure is an operational failure: no orders can be sent and no fills can be received until the session is restored.

Question 2

How does FIX session management work — logon, heartbeat, and logout?

Accepted Answer

A FIX session begins with a Logon message (35=A). The initiating party (typically the buy-side or broker-dealer) sends a Logon to the acceptor (typically the broker or venue) with a proposed heartbeat interval (HeartBtInt, tag 108) and the current sequence number. The acceptor responds with its own Logon acknowledging the session. During the session, both sides send Heartbeat messages (35=0) at the agreed interval — typically every 30 seconds — to confirm the connection is alive. If no message is received within the heartbeat interval, the waiting party sends a TestRequest (35=1); if no response arrives, the session is considered broken and is disconnected. Session termination is initiated by a Logout message (35=5). Abnormal disconnections — network failure, engine crash — require session recovery on reconnect, including ResendRequest to replay any messages sent during the outage.

Question 3

What are FIX sequence numbers and why do gaps matter?

Accepted Answer

Every FIX message carries a MsgSeqNum (tag 34) that increments by one for each message sent in a given direction on a session. The receiver tracks the expected next sequence number; when a message arrives with a sequence number higher than expected, a gap has occurred — one or more messages were not received. The receiving engine sends a ResendRequest (35=2) asking the sender to retransmit the missing messages. During the replay window, the receiver may continue processing newer messages with a flag indicating they are out-of-order, or may hold all processing until the gap is filled — behaviour varies by engine and configuration. Operationally, a sequence gap during execution means that fills recorded at the broker may not yet be reflected in the OMS or risk system. Gaps during high-volume periods can compound: the ResendRequest causes the counterparty to retransmit a large block of messages, potentially causing processing latency that widens the visibility window further.

Question 4

What is a drop copy session in FIX and why is it required?

Accepted Answer

A drop copy session is a secondary FIX session that receives real-time copies of all ExecutionReports — fills, partial fills, order acknowledgements — in parallel with the primary order management session. Drop copies are not used to send orders; they are passive recipients. The primary operational use of drop copies is to feed the risk management system and the compliance system with real-time position and execution data independent of the OMS. If the primary OMS session is slow to process fills, or if the OMS is temporarily unavailable, the risk system continues to receive ExecutionReports via the drop copy session. Regulatory requirements — including pre-trade risk checks and real-time exposure limits — depend on the risk system having timely and complete fill data. A drop copy session that is lagging or disconnected creates a risk blind spot: the risk system's position calculations are stale, and pre-trade checks may allow orders that breach limits.

Question 5

What is FIX engine high-availability architecture?

Accepted Answer

A high-availability FIX engine deployment ensures that a single point of failure — server hardware, network link, or process crash — does not terminate all active FIX sessions. The standard approach is an active-passive pair: two engine instances share session state, with the passive instance taking over if the active instance fails. Session state includes the current sequence numbers in both directions and the persisted message store, which allows the passive engine to resume the session without a full logout/logon cycle. Some engines support active-active configurations where sessions are distributed across multiple nodes. Network redundancy is a separate concern: a firm may maintain dual physical uplinks to a broker's network with automatic failover, so that a link failure does not cause a FIX session disconnection. Engine HA and network HA must be architected together; a highly available engine on a single network path still fails when the path goes down.

Question 6

What does FIX certification with a broker or venue involve?

Accepted Answer

FIX certification is the bilateral testing process that validates both sides of a FIX connection can exchange messages correctly before going live. Each broker and venue publishes a FIX specification — a document defining which message types are supported, which tags are mandatory or optional, what values are accepted for enumerated tags, and what custom tags (in the 5000–9999 range) are in use. Certification involves: configuring the engine with the counterparty's specification, connecting to the counterparty's certification environment, and executing a test script that covers all message flows the production session will use — order entry, cancellation, modification, execution reporting, and post-trade messages where applicable. Each counterparty runs its own certification programme. Large broker-dealers may take two to four weeks per counterparty; this is not a parallel activity — each certification requires dedicated connectivity environment access. Firms onboarding multiple brokers simultaneously must plan certification as a sequenced project. Going live on a session that has not been certified is a material operational risk: production message rejections may not surface until orders are live in the market.

Question 7

Commercial vs open-source FIX engines — how do they differ operationally?

Accepted Answer

Commercial FIX engines — such as those from Ullink (now Broadridge), OnixS, or B2BITS — offer built-in HA, monitoring tools, broad version support, and vendor support contracts. They handle edge cases in the FIX specification that open-source engines may not cover, and are pre-tested against a broader range of counterparty configurations. Open-source engines — most notably QuickFIX and QuickFIX/n — provide the core session protocol management and are widely deployed by mid-market firms and fintechs. They require internal development effort to build HA, monitoring, and counterparty-specific customisation. OMS-embedded FIX engines ship with the OMS and handle connectivity for that OMS's supported message types; they may not support the full range of FIX message versions or custom broker tags. The choice depends on throughput requirements, the number of counterparties, internal development capacity, and the firm's tolerance for building vs buying reliability infrastructure.

Question 8

What are the most operationally significant FIX engine failure modes?

Accepted Answer

Sequence gap leading to session logout: when gap recovery fails — because the sender's message store does not have the missing messages, or because the gap is too large for the acceptor's retry policy — the session is terminated. All fills during the outage must be reconciled manually; execution replay after ResendRequest also produces duplicate messages, so downstream systems must be idempotent on ExecID (tag 17). Resend storm: an aggressive gap-fill configuration causes the engine to retransmit a large number of messages simultaneously, generating processing backlog and latency spikes that delay new fill processing. Drop copy lag or disconnection: the risk system operates without current fill data, creating position blind spots. Session configuration mismatch on reconnect: if sequence numbers are reset on one side but not the other after a restart — the most common cause of Monday morning logon failures — the logon is rejected until both sides agree on the starting sequence out of band. Clock drift: NTP synchronisation failures cause SendingTime (tag 52) to drift; most counterparties enforce a tolerance window of ±120 seconds. BusinessMessageReject (35=j): indicates application-level refusal — the session is intact but the message content was invalid; distinct from a session-layer Reject (35=3) and requiring a different resolution path.

Deployment Profile	Throughput	Location	HA Option	Typical User
Commercial (licensed)	Very high	On-premise or cloud	Built-in active-passive	Tier-1 broker-dealer
Open source (QuickFIX)	Moderate	Self-hosted	Custom build required	Mid-market, fintech
OMS-embedded	Application-limited	Bundled with OMS	OMS HA model	Buy-side OMS user
Co-located / FPGA	Ultra-low latency	Exchange proximity	Redundant sessions	HFT, algo trading

FIX Engine Connectivity for Broker-Dealers

FIX Engine Connectivity for Broker-Dealers

Definition

FIX engine — deployment profiles

How it works

In Devancore™

Related terms

Contact