1# $OpenLDAP$ 2# Copyright 1999-2020 The OpenLDAP Foundation, All Rights Reserved. 3# COPYING RESTRICTIONS APPLY, see COPYRIGHT. 4 5H1: Replication 6 7Replicated directories are a fundamental requirement for delivering a 8resilient enterprise deployment. 9 10{{PRD:OpenLDAP}} has various configuration options for creating a replicated 11directory. In previous releases, replication was discussed in terms of 12a {{master}} server and some number of {{slave}} servers. A master 13accepted directory updates from other clients, and a slave only 14accepted updates from a (single) master. The replication structure 15was rigidly defined and any particular database could only fulfill 16a single role, either master or slave. 17 18As OpenLDAP now supports a wide variety of replication topologies, these 19terms have been deprecated in favor of {{provider}} and 20{{consumer}}: A provider replicates directory updates to consumers; 21consumers receive replication updates from providers. Unlike the 22rigidly defined master/slave relationships, provider/consumer roles 23are quite fluid: replication updates received in a consumer can be 24further propagated by that consumer to other servers, so a consumer 25can also act simultaneously as a provider. Also, a consumer need not 26be an actual LDAP server; it may be just an LDAP client. 27 28The following sections will describe the replication technology and 29discuss the various replication options that are available. 30 31H2: Replication Technology 32 33H3: LDAP Sync Replication 34 35The {{TERM:LDAP Sync}} Replication engine, {{TERM:syncrepl}} for 36short, is a consumer-side replication engine that enables the 37consumer {{TERM:LDAP}} server to maintain a shadow copy of a 38{{TERM:DIT}} fragment. A syncrepl engine resides at the consumer 39and executes as one of the {{slapd}}(8) threads. It creates and maintains a 40consumer replica by connecting to the replication provider to perform 41the initial DIT content load followed either by periodic content 42polling or by timely updates upon content changes. 43 44Syncrepl uses the LDAP Content Synchronization protocol (or LDAP Sync for 45short) as the replica synchronization protocol. LDAP Sync provides 46a stateful replication which supports both pull-based and push-based 47synchronization and does not mandate the use of a history store. 48In pull-based replication the consumer periodically 49polls the provider for updates. In push-based replication the consumer 50listens for updates that are sent by the provider in realtime. Since the 51protocol does not require a history store, the provider does not need to 52maintain any log of updates it has received (Note 53that the syncrepl engine is extensible and additional replication 54protocols may be supported in the future.). 55 56Syncrepl keeps track of the status of the replication content by 57maintaining and exchanging synchronization cookies. Because the 58syncrepl consumer and provider maintain their content status, the 59consumer can poll the provider content to perform incremental 60synchronization by asking for the entries required to make the 61consumer replica up-to-date with the provider content. Syncrepl 62also enables convenient management of replicas by maintaining replica 63status. The consumer replica can be constructed from a consumer-side 64or a provider-side backup at any synchronization status. Syncrepl 65can automatically resynchronize the consumer replica up-to-date 66with the current provider content. 67 68Syncrepl supports both pull-based and push-based synchronization. 69In its basic refreshOnly synchronization mode, the provider uses 70pull-based synchronization where the consumer servers need not be 71tracked and no history information is maintained. The information 72required for the provider to process periodic polling requests is 73contained in the synchronization cookie of the request itself. To 74optimize the pull-based synchronization, syncrepl utilizes the 75present phase of the LDAP Sync protocol as well as its delete phase, 76instead of falling back on frequent full reloads. To further optimize 77the pull-based synchronization, the provider can maintain a per-scope 78session log as a history store. In its refreshAndPersist mode of 79synchronization, the provider uses a push-based synchronization. 80The provider keeps track of the consumer servers that have requested 81a persistent search and sends them necessary updates as the provider 82replication content gets modified. 83 84With syncrepl, a consumer server can create a replica without 85changing the provider's configurations and without restarting the 86provider server, if the consumer server has appropriate access 87privileges for the DIT fragment to be replicated. The consumer 88server can stop the replication also without the need for provider-side 89changes and restart. 90 91Syncrepl supports partial, sparse, and fractional replications. The shadow 92DIT fragment is defined by a general search criteria consisting of 93base, scope, filter, and attribute list. The replica content is 94also subject to the access privileges of the bind identity of the 95syncrepl replication connection. 96 97 98H4: The LDAP Content Synchronization Protocol 99 100The LDAP Sync protocol allows a client to maintain a synchronized 101copy of a DIT fragment. The LDAP Sync operation is defined as a set 102of controls and other protocol elements which extend the LDAP search 103operation. This section introduces the LDAP Content Sync protocol 104only briefly. For more information, refer to {{REF:RFC4533}}. 105 106The LDAP Sync protocol supports both polling and listening for changes 107by defining two respective synchronization operations: 108{{refreshOnly}} and {{refreshAndPersist}}. Polling is implemented 109by the {{refreshOnly}} operation. The consumer 110polls the provider using an LDAP Search request with an LDAP Sync 111control attached. The consumer copy is synchronized 112to the provider copy at the time of polling using the information 113returned in the search. The provider finishes the 114search operation by returning {{SearchResultDone}} at the end of 115the search operation as in the normal search. Listening is 116implemented by the {{refreshAndPersist}} operation. As the name 117implies, it begins with a search, like refreshOnly. Instead of 118finishing the search after returning all entries currently matching 119the search criteria, the synchronization search remains persistent 120in the provider. Subsequent updates to the synchronization content 121in the provider cause additional entry updates to be sent to the 122consumer. 123 124The {{refreshOnly}} operation and the refresh stage of the 125{{refreshAndPersist}} operation can be performed with a present 126phase or a delete phase. 127 128In the present phase, the provider sends the consumer the entries updated 129within the search scope since the last synchronization. The provider 130sends all requested attributes, be they changed or not, of the updated 131entries. For each unchanged entry which remains in the scope, the 132provider sends a present message consisting only of the name of the 133entry and the synchronization control representing state present. 134The present message does not contain any attributes of the entry. 135After the consumer receives all update and present entries, it can 136reliably determine the new consumer copy by adding the entries added 137to the provider, by replacing the entries modified at the provider, and 138by deleting entries in the consumer copy which have not been updated 139nor specified as being present at the provider. 140 141The transmission of the updated entries in the delete phase is the 142same as in the present phase. The provider sends all the requested 143attributes of the entries updated within the search scope since the 144last synchronization to the consumer. In the delete phase, however, 145the provider sends a delete message for each entry deleted from the 146search scope, instead of sending present messages. The delete 147message consists only of the name of the entry and the synchronization 148control representing state delete. The new consumer copy can be 149determined by adding, modifying, and removing entries according to 150the synchronization control attached to the {{SearchResultEntry}} 151message. 152 153In the case that the LDAP Sync provider maintains a history store and 154can determine which entries are scoped out of the consumer copy since 155the last synchronization time, the provider can use the delete phase. 156If the provider does not maintain any history store, cannot determine 157the scoped-out entries from the history store, or the history store 158does not cover the outdated synchronization state of the consumer, 159the provider should use the present phase. The use of the present 160phase is much more efficient than a full content reload in terms 161of the synchronization traffic. To reduce the synchronization 162traffic further, the LDAP Sync protocol also provides several 163optimizations such as the transmission of the normalized {{EX:entryUUID}}s 164and the transmission of multiple {{EX:entryUUIDs}} in a single 165{{syncIdSet}} message. 166 167At the end of the {{refreshOnly}} synchronization, the provider sends 168a synchronization cookie to the consumer as a state indicator of the 169consumer copy after the synchronization is completed. The consumer 170will present the received cookie when it requests the next incremental 171synchronization to the provider. 172 173When {{refreshAndPersist}} synchronization is used, the provider sends 174a synchronization cookie at the end of the refresh stage by sending 175a Sync Info message with refreshDone=TRUE. It also sends a 176synchronization cookie by attaching it to {{SearchResultEntry}} 177messages generated in the persist stage of the synchronization search. During 178the persist stage, the provider can also send a Sync Info message 179containing the synchronization cookie at any time the provider wants 180to update the consumer-side state indicator. 181 182In the LDAP Sync protocol, entries are uniquely identified by the 183{{EX:entryUUID}} attribute value. It can function as a reliable 184identifier of the entry. The DN of the entry, on the other hand, 185can be changed over time and hence cannot be considered as the 186reliable identifier. The {{EX:entryUUID}} is attached to each 187{{SearchResultEntry}} or {{SearchResultReference}} as a part of the 188synchronization control. 189 190H4: Syncrepl Details 191 192The syncrepl engine utilizes both the {{refreshOnly}} and the 193{{refreshAndPersist}} operations of the LDAP Sync protocol. If a 194syncrepl specification is included in a database definition, 195{{slapd}}(8) launches a syncrepl engine as a {{slapd}}(8) thread 196and schedules its execution. If the {{refreshOnly}} operation is 197specified, the syncrepl engine will be rescheduled at the interval 198time after a synchronization operation is completed. If the 199{{refreshAndPersist}} operation is specified, the engine will remain 200active and process the persistent synchronization messages from the 201provider. 202 203The syncrepl engine utilizes both the present phase and the delete 204phase of the refresh synchronization. It is possible to configure 205a session log in the provider which stores the 206{{EX:entryUUID}}s of a finite number of entries deleted from a 207database. Multiple replicas share the same session log. The syncrepl 208engine uses the 209delete phase if the session log is present and the state of the 210consumer server is recent enough that no session log entries are 211truncated after the last synchronization of the client. The syncrepl 212engine uses the present phase if no session log is configured for 213the replication content or if the consumer replica is too outdated 214to be covered by the session log. The current design of the session 215log store is memory based, so the information contained in the 216session log is not persistent over multiple provider invocations. 217It is not currently supported to access the session log store by 218using LDAP operations. It is also not currently supported to impose 219access control to the session log. 220 221As a further optimization, even in the case the synchronization 222search is not associated with any session log, no entries will be 223transmitted to the consumer server when there has been no update 224in the replication context. 225 226The syncrepl engine, which is a consumer-side replication engine, 227can work with any backends. The LDAP Sync provider can be configured 228as an overlay on any backend, but works best with the {{back-bdb}}, 229{{back-hdb}}, or {{back-mdb}} backends. 230 231The LDAP Sync provider maintains a {{EX:contextCSN}} for each 232database as the current synchronization state indicator of the 233provider content. It is the largest {{EX:entryCSN}} in the provider 234context such that no transactions for an entry having smaller 235{{EX:entryCSN}} value remains outstanding. The {{EX:contextCSN}} 236could not just be set to the largest issued {{EX:entryCSN}} because 237{{EX:entryCSN}} is obtained before a transaction starts and 238transactions are not committed in the issue order. 239 240The provider stores the {{EX:contextCSN}} of a context in the 241{{EX:contextCSN}} attribute of the context suffix entry. The attribute 242is not written to the database after every update operation though; 243instead it is maintained primarily in memory. At database start 244time the provider reads the last saved {{EX:contextCSN}} into memory 245and uses the in-memory copy exclusively thereafter. By default, 246changes to the {{EX:contextCSN}} as a result of database updates 247will not be written to the database until the server is cleanly 248shut down. A checkpoint facility exists to cause the {{EX:contextCSN}} to 249be written out more frequently if desired. 250 251Note that at startup time, if the provider is unable to read a 252{{EX:contextCSN}} from the suffix entry, it will scan the entire 253database to determine the value, and this scan may take quite a 254long time on a large database. When a {{EX:contextCSN}} value is 255read, the database will still be scanned for any {{EX:entryCSN}} 256values greater than it, to make sure the {{EX:contextCSN}} value 257truly reflects the greatest committed {{EX:entryCSN}} in the database. 258On databases which support inequality indexing, setting an eq index 259on the {{EX:entryCSN}} attribute and configuring {{contextCSN}} 260checkpoints will greatly speed up this scanning step. 261 262If no {{EX:contextCSN}} can be determined by reading and scanning 263the database, a new value will be generated. Also, if scanning the 264database yielded a greater {{EX:entryCSN}} than was previously 265recorded in the suffix entry's {{EX:contextCSN}} attribute, a 266checkpoint will be immediately written with the new value. 267 268The consumer also stores its replica state, which is the provider's 269{{EX:contextCSN}} received as a synchronization cookie, in the 270{{EX:contextCSN}} attribute of the suffix entry. The replica state 271maintained by a consumer server is used as the synchronization state 272indicator when it performs subsequent incremental synchronization 273with the provider server. It is also used as a provider-side 274synchronization state indicator when it functions as a secondary 275provider server in a cascading replication configuration. Since 276the consumer and provider state information are maintained in the 277same location within their respective databases, any consumer can 278be promoted to a provider (and vice versa) without any special 279actions. 280 281Because a general search filter can be used in the syncrepl 282specification, some entries in the context may be omitted from the 283synchronization content. The syncrepl engine creates a glue entry 284to fill in the holes in the replica context if any part of the 285replica content is subordinate to the holes. The glue entries will 286not be returned in the search result unless {{ManageDsaIT}} control 287is provided. 288 289Also as a consequence of the search filter used in the syncrepl 290specification, it is possible for a modification to remove an entry 291from the replication scope even though the entry has not been deleted 292on the provider. Logically the entry must be deleted on the consumer 293but in {{refreshOnly}} mode the provider cannot detect and propagate 294this change without the use of the session log on the provider. 295 296For configuration, please see the {{SECT:Syncrepl}} section. 297 298 299H2: Deployment Alternatives 300 301While the LDAP Sync specification only defines a narrow scope for replication, 302the OpenLDAP implementation is extremely flexible and supports a variety of 303operating modes to handle other scenarios not explicitly addressed in the spec. 304 305 306H3: Delta-syncrepl replication 307 308* Disadvantages of LDAP Sync replication: 309 310LDAP Sync replication is an object-based replication mechanism. 311When any attribute value in a replicated object is changed on the provider, 312each consumer fetches and processes the complete changed object, including 313{{B:both the changed and unchanged attribute values}} during replication. 314One advantage of this approach is that when multiple changes occur to 315a single object, the precise sequence of those changes need not be preserved; 316only the final state of the entry is significant. But this approach 317may have drawbacks when the usage pattern involves single changes to 318multiple objects. 319 320For example, suppose you have a database consisting of 102,400 objects of 1 KB 321each. Further, suppose you routinely run a batch job to change the value of 322a single two-byte attribute value that appears in each of the 102,400 objects 323on the master. Not counting LDAP and TCP/IP protocol overhead, each time you 324run this job each consumer will transfer and process {{B:100 MB}} of data to 325process {{B:200KB of changes!}} 326 32799.98% of the data that is transmitted and processed in a case like this will 328be redundant, since it represents values that did not change. This is a waste 329of valuable transmission and processing bandwidth and can cause an unacceptable 330replication backlog to develop. While this situation is extreme, it serves to 331demonstrate a very real problem that is encountered in some LDAP deployments. 332 333 334* Where Delta-syncrepl comes in: 335 336Delta-syncrepl, a changelog-based variant of syncrepl, is designed to address 337situations like the one described above. Delta-syncrepl works by maintaining a 338changelog of a selectable depth in a separate database on the provider. The replication consumer 339checks the changelog for the changes it needs and, as long as 340the changelog contains the needed changes, the consumer fetches the changes 341from the changelog and applies them to its database. If, however, a replica 342is too far out of sync (or completely empty), conventional syncrepl is used to 343bring it up to date and replication then switches back to the delta-syncrepl 344mode. 345 346Note: since the database state is stored in both the changelog DB and the 347main DB on the provider, it is important to backup/restore both the changelog 348DB and the main DB using slapcat/slapadd when restoring a DB or copying 349it to another machine. 350 351For configuration, please see the {{SECT:Delta-syncrepl}} section. 352 353 354H3: N-Way Multi-Master replication 355 356Multi-Master replication is a replication technique using Syncrepl to replicate 357data to multiple provider ("Master") Directory servers. 358 359H4: Valid Arguments for Multi-Master replication 360 361* If any provider fails, other providers will continue to accept updates 362* Avoids a single point of failure 363* Providers can be located in several physical sites i.e. distributed across 364the network/globe. 365* Good for Automatic failover/High Availability 366 367H4: Invalid Arguments for Multi-Master replication 368 369(These are often claimed to be advantages of Multi-Master replication but 370those claims are false): 371 372* It has {{B:NOTHING}} to do with load balancing 373* Providers {{B:must}} propagate writes to {{B:all}} the other servers, which 374means the network traffic and write load spreads across all 375of the servers the same as for single-master. 376* Server utilization and performance are at best identical for 377Multi-Master and Single-Master replication; at worst Single-Master is 378superior because indexing can be tuned differently to optimize for the 379different usage patterns between the provider and the consumers. 380 381H4: Arguments against Multi-Master replication 382 383* Breaks the data consistency guarantees of the directory model 384* {{URL:http://www.openldap.org/faq/data/cache/1240.html}} 385* If connectivity with a provider is lost because of a network partition, then 386"automatic failover" can just compound the problem 387* Typically, a particular machine cannot distinguish between losing contact 388 with a peer because that peer crashed, or because the network link has failed 389* If a network is partitioned and multiple clients start writing to each of the 390"masters" then reconciliation will be a pain; it may be best to simply deny 391writes to the clients that are partitioned from the single provider 392 393 394For configuration, please see the {{SECT:N-Way Multi-Master}} section below 395 396H3: MirrorMode replication 397 398MirrorMode is a hybrid configuration that provides all of the consistency 399guarantees of single-master replication, while also providing the high 400availability of multi-master. In MirrorMode two providers are set up to 401replicate from each other (as a multi-master configuration), but an 402external frontend is employed to direct all writes to only one of 403the two servers. The second provider will only be used for writes if 404the first provider crashes, at which point the frontend will switch to 405directing all writes to the second provider. When a crashed provider is 406repaired and restarted it will automatically catch up to any changes 407on the running provider and resync. 408 409H4: Arguments for MirrorMode 410 411* Provides a high-availability (HA) solution for directory writes (replicas handle reads) 412* As long as one provider is operational, writes can safely be accepted 413* Provider nodes replicate from each other, so they are always up to date and 414can be ready to take over (hot standby) 415* Syncrepl also allows the provider nodes to re-synchronize after any downtime 416 417 418H4: Arguments against MirrorMode 419 420* MirrorMode is not what is termed as a Multi-Master solution. This is because 421writes have to go to just one of the mirror nodes at a time 422* MirrorMode can be termed as Active-Active Hot-Standby, therefore an external 423server (slapd in proxy mode) or device (hardware load balancer) 424is needed to manage which provider is currently active 425* Backups are managed slightly differently 426- If backing up the Berkeley database itself and periodically backing up the 427transaction log files, then the same member of the mirror pair needs to be 428used to collect logfiles until the next database backup is taken 429 430For configuration, please see the {{SECT:MirrorMode}} section below 431 432 433H3: Syncrepl Proxy Mode 434 435While the LDAP Sync protocol supports both pull- and push-based replication, 436the push mode (refreshAndPersist) must still be initiated from the consumer 437before the provider can begin pushing changes. In some network configurations, 438particularly where firewalls restrict the direction in which connections 439can be made, a provider-initiated push mode may be needed. 440 441This mode can be configured with the aid of the LDAP Backend 442({{SECT: Backends}} and {{slapd-ldap(8)}}). Instead of running the 443syncrepl engine on the actual consumer, a slapd-ldap proxy is set up 444near (or collocated with) the provider that points to the consumer, 445and the syncrepl engine runs on the proxy. 446 447For configuration, please see the {{SECT:Syncrepl Proxy}} section. 448 449H4: Replacing Slurpd 450 451The old {{slurpd}} mechanism only operated in provider-initiated 452push mode. Slurpd replication was deprecated in favor of Syncrepl 453replication and has been completely removed from OpenLDAP 2.4. 454 455The slurpd daemon was the original replication mechanism inherited from 456UMich's LDAP and operated in push mode: the master pushed changes to the 457slaves. It was replaced for many reasons, in brief: 458 459 * It was not reliable 460 ** It was extremely sensitive to the ordering of records in the replog 461 ** It could easily go out of sync, at which point manual intervention was 462 required to resync the slave database with the master directory 463 ** It wasn't very tolerant of unavailable servers. If a slave went down 464 for a long time, the replog could grow to a size that was too large for 465 slurpd to process 466 * It only worked in push mode 467 * It required stopping and restarting the master to add new slaves 468 * It only supported single master replication 469 470Syncrepl has none of those weaknesses: 471 472 * Syncrepl is self-synchronizing; you can start with a consumer database 473 in any state from totally empty to fully synced and it will automatically 474 do the right thing to achieve and maintain synchronization 475 ** It is completely insensitive to the order in which changes occur 476 ** It guarantees convergence between the consumer and the provider 477 content without manual intervention 478 ** It can resynchronize regardless of how long a consumer stays out 479 of contact with the provider 480 * Syncrepl can operate in either direction 481 * Consumers can be added at any time without touching anything on the 482 provider 483 * Multi-master replication is supported 484 485 486H2: Configuring the different replication types 487 488H3: Syncrepl 489 490H4: Syncrepl configuration 491 492Because syncrepl is a consumer-side replication engine, the syncrepl 493specification is defined in {{slapd.conf}}(5) of the consumer 494server, not in the provider server's configuration file. The initial 495loading of the replica content can be performed either by starting 496the syncrepl engine with no synchronization cookie or by populating 497the consumer replica by loading an {{TERM:LDIF}} file dumped as a 498backup at the provider. 499 500When loading from a backup, it is not required to perform the initial 501loading from the up-to-date backup of the provider content. The 502syncrepl engine will automatically synchronize the initial consumer 503replica to the current provider content. As a result, it is not 504required to stop the provider server in order to avoid the replica 505inconsistency caused by the updates to the provider content during 506the content backup and loading process. 507 508When replicating a large scale directory, especially in a bandwidth 509constrained environment, it is advised to load the consumer replica 510from a backup instead of performing a full initial load using 511syncrepl. 512 513 514H4: Set up the provider slapd 515 516The provider is implemented as an overlay, so the overlay itself 517must first be configured in {{slapd.conf}}(5) before it can be 518used. The provider has two primary configuration directives and 519two secondary directives for when delta-syncrepl is being used. 520Because the LDAP Sync search is subject to access control, proper 521access control privileges should be set up for the replicated 522content. 523 524The two primary options to configure are the checkpoint and 525sessionlog behaviors. 526 527The {{EX:contextCSN}} checkpoint is configured by the 528 529> syncprov-checkpoint <ops> <minutes> 530 531directive. Checkpoints are only tested after successful write 532operations. If {{<ops>}} operations or more than {{<minutes>}} 533time has passed since the last checkpoint, a new checkpoint is 534performed. Checkpointing is disabled by default. 535 536The session log is configured by the 537 538> syncprov-sessionlog <ops> 539 540directive, where {{<ops>}} is the maximum number of session log 541entries the session log can record. All write operations (except Adds) 542are recorded in the log. 543 544Note that using the session log requires searching on the {{entryUUID}} 545attribute. Setting an eq index on this attribute will greatly benefit 546the performance of the session log on the provider. 547 548The reloadhint option is configured by the 549 550> syncprov-reloadhint <TRUE|FALSE> 551 552directive. It must be set TRUE when using the accesslog overlay for 553delta-based syncrepl replication support. The default is FALSE. 554 555The nonpresent option should only be configured if the overlay is 556being placed on top of a log database, such as when used with 557delta-syncrepl. 558 559The nonpresent option is configured by the 560 561> syncprov-nopresent <TRUE|FALSE> 562 563directive. This value should only be set TRUE for a syncprov instance 564on top of a log database (such as one managed by the accesslog overlay). 565The default is FALSE. 566 567A more complete example of the {{slapd.conf}}(5) content is thus: 568 569> database mdb 570> maxsize 85899345920 571> suffix dc=example,dc=com 572> rootdn dc=example,dc=com 573> directory /var/ldap/db 574> index objectclass,entryCSN,entryUUID eq 575> 576> overlay syncprov 577> syncprov-checkpoint 100 10 578> syncprov-sessionlog 100 579 580 581H4: Set up the consumer slapd 582 583The syncrepl replication is specified in the database section of 584{{slapd.conf}}(5) for the replica context. The syncrepl engine 585is backend independent and the directive can be defined with any 586database type. 587 588> database mdb 589> maxsize 85899345920 590> suffix dc=example,dc=com 591> rootdn dc=example,dc=com 592> directory /var/ldap/db 593> index objectclass,entryCSN,entryUUID eq 594> 595> syncrepl rid=123 596> provider=ldap://provider.example.com:389 597> type=refreshOnly 598> interval=01:00:00:00 599> searchbase="dc=example,dc=com" 600> filter="(objectClass=organizationalPerson)" 601> scope=sub 602> attrs="cn,sn,ou,telephoneNumber,title,l" 603> schemachecking=off 604> bindmethod=simple 605> binddn="cn=syncuser,dc=example,dc=com" 606> credentials=secret 607 608In this example, the consumer will connect to the provider {{slapd}}(8) 609at port 389 of {{FILE:ldap://provider.example.com}} to perform a 610polling ({{refreshOnly}}) mode of synchronization once a day. It 611will bind as {{EX:cn=syncuser,dc=example,dc=com}} using simple 612authentication with password "secret". Note that the access control 613privilege of {{EX:cn=syncuser,dc=example,dc=com}} should be set 614appropriately in the provider to retrieve the desired replication 615content. Also the search limits must be high enough on the provider 616to allow the syncuser to retrieve a complete copy of the requested 617content. The consumer uses the rootdn to write to its database so 618it always has full permissions to write all content. 619 620The synchronization search in the above example will search for the 621entries whose objectClass is organizationalPerson in the entire 622subtree rooted at {{EX:dc=example,dc=com}}. The requested attributes 623are {{EX:cn}}, {{EX:sn}}, {{EX:ou}}, {{EX:telephoneNumber}}, 624{{EX:title}}, and {{EX:l}}. The schema checking is turned off, so 625that the consumer {{slapd}}(8) will not enforce entry schema 626checking when it processes updates from the provider {{slapd}}(8). 627 628For more detailed information on the syncrepl directive, see the 629{{SECT:syncrepl}} section of {{SECT:The slapd Configuration File}} 630chapter of this admin guide. 631 632 633H4: Start the provider and the consumer slapd 634 635The provider {{slapd}}(8) is not required to be restarted. 636{{contextCSN}} is automatically generated as needed: it might be 637originally contained in the {{TERM:LDIF}} file, generated by 638{{slapadd}} (8), generated upon changes in the context, or generated 639when the first LDAP Sync search arrives at the provider. If an 640LDIF file is being loaded which did not previously contain the 641{{contextCSN}}, the {{-w}} option should be used with {{slapadd}} 642(8) to cause it to be generated. This will allow the server to 643startup a little quicker the first time it runs. 644 645When starting a consumer {{slapd}}(8), it is possible to provide 646a synchronization cookie as the {{-c cookie}} command line option 647in order to start the synchronization from a specific state. The 648cookie is a comma separated list of name=value pairs. Currently 649supported syncrepl cookie fields are {{csn=<csn>}} and {{rid=<rid>}}. 650{{<csn>}} represents the current synchronization state of the 651consumer replica. {{<rid>}} identifies a consumer replica locally 652within the consumer server. It is used to relate the cookie to the 653syncrepl definition in {{slapd.conf}}(5) which has the matching 654replica identifier. The {{<rid>}} must have no more than 3 decimal 655digits. The command line cookie overrides the synchronization 656cookie stored in the consumer replica database. 657 658 659H3: Delta-syncrepl 660 661H4: Delta-syncrepl Provider configuration 662 663Setting up delta-syncrepl requires configuration changes on both the master and 664replica servers: 665 666> # Give the replica DN unlimited read access. This ACL needs to be 667> # merged with other ACL statements, and/or moved within the scope 668> # of a database. The "by * break" portion causes evaluation of 669> # subsequent rules. See slapd.access(5) for details. 670> access to * 671> by dn.base="cn=replicator,dc=example,dc=com" read 672> by * break 673> 674> # Set the module path location 675> modulepath /usr/lib/openldap 676> 677> # Load the mdb backend 678> moduleload back_mdb.la 679> 680> # Load the accesslog overlay 681> moduleload accesslog.la 682> 683> #Load the syncprov overlay 684> moduleload syncprov.la 685> 686> # Accesslog database definitions 687> database mdb 688> suffix cn=accesslog 689> rootdn cn=accesslog 690> directory /var/lib/db/accesslog 691> maxsize 85899345920 692> index default eq 693> index entryCSN,objectClass,reqEnd,reqResult,reqStart,reqDN 694> 695> overlay syncprov 696> syncprov-nopresent TRUE 697> syncprov-reloadhint TRUE 698> 699> # Let the replica DN have limitless searches 700> limits dn.exact="cn=replicator,dc=example,dc=com" time.soft=unlimited time.hard=unlimited size.soft=unlimited size.hard=unlimited 701> 702> # Primary database definitions 703> database mdb 704> suffix "dc=example,dc=com" 705> rootdn "cn=manager,dc=example,dc=com" 706> maxsize 85899345920 707> 708> ## Whatever other configuration options are desired 709> 710> # syncprov specific indexing 711> index entryCSN eq 712> index entryUUID eq 713> 714> # syncrepl Provider for primary db 715> overlay syncprov 716> syncprov-checkpoint 1000 60 717> 718> # accesslog overlay definitions for primary db 719> overlay accesslog 720> logdb cn=accesslog 721> logops writes 722> logsuccess TRUE 723> # scan the accesslog DB every day, and purge entries older than 7 days 724> logpurge 07+00:00 01+00:00 725> 726> # Let the replica DN have limitless searches 727> limits dn.exact="cn=replicator,dc=example,dc=com" time.soft=unlimited time.hard=unlimited size.soft=unlimited size.hard=unlimited 728 729For more information, always consult the relevant man pages ({{slapo-accesslog}}(5) and {{slapd.conf}}(5)) 730 731 732H4: Delta-syncrepl Consumer configuration 733 734> # Replica database configuration 735> database mdb 736> suffix "dc=example,dc=com" 737> rootdn "cn=manager,dc=example,dc=com" 738> maxsize 85899345920 739> 740> ## Whatever other configuration bits for the replica, like indexing 741> ## that you want 742> 743> # syncrepl specific indices 744> index entryUUID eq 745> 746> # syncrepl directives 747> syncrepl rid=0 748> provider=ldap://ldapmaster.example.com:389 749> bindmethod=simple 750> binddn="cn=replicator,dc=example,dc=com" 751> credentials=secret 752> searchbase="dc=example,dc=com" 753> logbase="cn=accesslog" 754> logfilter="(&(objectClass=auditWriteObject)(reqResult=0))" 755> schemachecking=on 756> type=refreshAndPersist 757> retry="60 +" 758> syncdata=accesslog 759> 760> # Refer updates to the master 761> updateref ldap://ldapmaster.example.com 762 763 764The above configuration assumes that you have a replicator identity defined 765in your database that can be used to bind to the provider. In addition, 766all of the databases (primary, replica, and the accesslog 767storage database) should also have properly tuned {{DB_CONFIG}} files that meet 768your needs. 769 770Note: An accesslog database is unique to a given master. It should 771never be replicated. 772 773H3: N-Way Multi-Master 774 775For the following example we will be using 3 Master nodes. Keeping in line with 776{{B:test050-syncrepl-multimaster}} of the OpenLDAP test suite, we will be configuring 777{{slapd(8)}} via {{B:cn=config}} 778 779This sets up the config database: 780 781> dn: cn=config 782> objectClass: olcGlobal 783> cn: config 784> olcServerID: 1 785> 786> dn: olcDatabase={0}config,cn=config 787> objectClass: olcDatabaseConfig 788> olcDatabase: {0}config 789> olcRootPW: secret 790 791second and third servers will have a different olcServerID obviously: 792 793> dn: cn=config 794> objectClass: olcGlobal 795> cn: config 796> olcServerID: 2 797> 798> dn: olcDatabase={0}config,cn=config 799> objectClass: olcDatabaseConfig 800> olcDatabase: {0}config 801> olcRootPW: secret 802 803This sets up syncrepl as a provider (since these are all masters): 804 805> dn: cn=module,cn=config 806> objectClass: olcModuleList 807> cn: module 808> olcModulePath: /usr/local/libexec/openldap 809> olcModuleLoad: syncprov.la 810 811Now we setup the first Master Node (replace $URI1, $URI2 and $URI3 etc. with your actual ldap urls): 812 813> dn: cn=config 814> changetype: modify 815> replace: olcServerID 816> olcServerID: 1 $URI1 817> olcServerID: 2 $URI2 818> olcServerID: 3 $URI3 819> 820> dn: olcOverlay=syncprov,olcDatabase={0}config,cn=config 821> changetype: add 822> objectClass: olcOverlayConfig 823> objectClass: olcSyncProvConfig 824> olcOverlay: syncprov 825> 826> dn: olcDatabase={0}config,cn=config 827> changetype: modify 828> add: olcSyncRepl 829> olcSyncRepl: rid=001 provider=$URI1 binddn="cn=config" bindmethod=simple 830> credentials=secret searchbase="cn=config" type=refreshAndPersist 831> retry="5 5 300 5" timeout=1 832> olcSyncRepl: rid=002 provider=$URI2 binddn="cn=config" bindmethod=simple 833> credentials=secret searchbase="cn=config" type=refreshAndPersist 834> retry="5 5 300 5" timeout=1 835> olcSyncRepl: rid=003 provider=$URI3 binddn="cn=config" bindmethod=simple 836> credentials=secret searchbase="cn=config" type=refreshAndPersist 837> retry="5 5 300 5" timeout=1 838> - 839> add: olcMirrorMode 840> olcMirrorMode: TRUE 841 842Now start up the Master and a consumer/s, also add the above LDIF to the first consumer, second consumer etc. It will then replicate {{B:cn=config}}. You now have N-Way Multimaster on the config database. 843 844We still have to replicate the actual data, not just the config, so add to the master (all active and configured consumers/masters will pull down this config, as they are all syncing). Also, replace all {{${}}} variables with whatever is applicable to your setup: 845 846> dn: olcDatabase={1}$BACKEND,cn=config 847> objectClass: olcDatabaseConfig 848> objectClass: olc${BACKEND}Config 849> olcDatabase: {1}$BACKEND 850> olcSuffix: $BASEDN 851> olcDbDirectory: ./db 852> olcRootDN: $MANAGERDN 853> olcRootPW: $PASSWD 854> olcLimits: dn.exact="$MANAGERDN" time.soft=unlimited time.hard=unlimited size.soft=unlimited size.hard=unlimited 855> olcSyncRepl: rid=004 provider=$URI1 binddn="$MANAGERDN" bindmethod=simple 856> credentials=$PASSWD searchbase="$BASEDN" type=refreshOnly 857> interval=00:00:00:10 retry="5 5 300 5" timeout=1 858> olcSyncRepl: rid=005 provider=$URI2 binddn="$MANAGERDN" bindmethod=simple 859> credentials=$PASSWD searchbase="$BASEDN" type=refreshOnly 860> interval=00:00:00:10 retry="5 5 300 5" timeout=1 861> olcSyncRepl: rid=006 provider=$URI3 binddn="$MANAGERDN" bindmethod=simple 862> credentials=$PASSWD searchbase="$BASEDN" type=refreshOnly 863> interval=00:00:00:10 retry="5 5 300 5" timeout=1 864> olcMirrorMode: TRUE 865> 866> dn: olcOverlay=syncprov,olcDatabase={1}${BACKEND},cn=config 867> changetype: add 868> objectClass: olcOverlayConfig 869> objectClass: olcSyncProvConfig 870> olcOverlay: syncprov 871 872Note: All of your servers' clocks must be tightly synchronized using 873e.g. NTP {{http://www.ntp.org/}}, atomic clock, or some other reliable 874time reference. 875 876Note: As stated in {{slapd-config}}(5), URLs specified in {{olcSyncRepl}} 877directives are the URLs of the servers from which to replicate. These 878must exactly match the URLs {{slapd}} listens on ({{-h}} in {{SECT:Command-Line Options}}). 879Otherwise slapd may attempt to replicate from itself, causing a loop. 880 881H3: MirrorMode 882 883MirrorMode configuration is actually very easy. If you have ever setup a normal 884slapd syncrepl provider, then the only change is the following two directives: 885 886> mirrormode on 887> serverID 1 888 889Note: You need to make sure that the {{serverID}} of each mirror node is 890different and add it as a global configuration option. 891 892H4: Mirror Node Configuration 893 894The first step is to configure the syncrepl provider the same as in the 895{{SECT:Set up the provider slapd}} section. 896 897Here's a specific cut down example using {{SECT:LDAP Sync Replication}} in 898{{refreshAndPersist}} mode: 899 900MirrorMode node 1: 901 902> # Global section 903> serverID 1 904> # database section 905> 906> # syncrepl directive 907> syncrepl rid=001 908> provider=ldap://ldap-sid2.example.com 909> bindmethod=simple 910> binddn="cn=mirrormode,dc=example,dc=com" 911> credentials=mirrormode 912> searchbase="dc=example,dc=com" 913> schemachecking=on 914> type=refreshAndPersist 915> retry="60 +" 916> 917> mirrormode on 918 919MirrorMode node 2: 920 921> # Global section 922> serverID 2 923> # database section 924> 925> # syncrepl directive 926> syncrepl rid=001 927> provider=ldap://ldap-sid1.example.com 928> bindmethod=simple 929> binddn="cn=mirrormode,dc=example,dc=com" 930> credentials=mirrormode 931> searchbase="dc=example,dc=com" 932> schemachecking=on 933> type=refreshAndPersist 934> retry="60 +" 935> 936> mirrormode on 937 938It's simple really; each MirrorMode node is setup {{B:exactly}} the same, except 939that the {{serverID}} is unique, and each consumer is pointed to 940the other server. 941 942H5: Failover Configuration 943 944There are generally 2 choices for this; 1. Hardware proxies/load-balancing or 945dedicated proxy software, 2. using a Back-LDAP proxy as a syncrepl provider 946 947A typical enterprise example might be: 948 949!import "dual_dc.png"; align="center"; title="MirrorMode Enterprise Configuration" 950FT[align="Center"] Figure X.Y: MirrorMode in a Dual Data Center Configuration 951 952H5: Normal Consumer Configuration 953 954This is exactly the same as the {{SECT:Set up the consumer slapd}} section. It 955can either setup in normal {{SECT:syncrepl replication}} mode, or in 956{{SECT:delta-syncrepl replication}} mode. 957 958H4: MirrorMode Summary 959 960You will now have a directory architecture that provides all of the 961consistency guarantees of single-master replication, while also providing the 962high availability of multi-master replication. 963 964 965H3: Syncrepl Proxy 966 967!import "push-based-complete.png"; align="center"; title="Syncrepl Proxy Mode" 968FT[align="Center"] Figure X.Y: Replacing slurpd 969 970The following example is for a self-contained push-based replication solution: 971 972> ####################################################################### 973> # Standard OpenLDAP Master/Provider 974> ####################################################################### 975> 976> include /usr/local/etc/openldap/schema/core.schema 977> include /usr/local/etc/openldap/schema/cosine.schema 978> include /usr/local/etc/openldap/schema/nis.schema 979> include /usr/local/etc/openldap/schema/inetorgperson.schema 980> 981> include /usr/local/etc/openldap/slapd.acl 982> 983> modulepath /usr/local/libexec/openldap 984> moduleload back_mdb.la 985> moduleload syncprov.la 986> moduleload back_monitor.la 987> moduleload back_ldap.la 988> 989> pidfile /usr/local/var/slapd.pid 990> argsfile /usr/local/var/slapd.args 991> 992> loglevel sync stats 993> 994> database mdb 995> suffix "dc=suretecsystems,dc=com" 996> directory /usr/local/var/openldap-data 997> maxsize 85899345920 998> 999> checkpoint 1024 5 1000> 1001> index objectClass eq 1002> # rest of indexes 1003> index default sub 1004> 1005> rootdn "cn=admin,dc=suretecsystems,dc=com" 1006> rootpw testing 1007> 1008> # syncprov specific indexing 1009> index entryCSN eq 1010> index entryUUID eq 1011> 1012> # syncrepl Provider for primary db 1013> overlay syncprov 1014> syncprov-checkpoint 1000 60 1015> 1016> # Let the replica DN have limitless searches 1017> limits dn.exact="cn=replicator,dc=suretecsystems,dc=com" time.soft=unlimited time.hard=unlimited size.soft=unlimited size.hard=unlimited 1018> 1019> database monitor 1020> 1021> database config 1022> rootpw testing 1023> 1024> ############################################################################## 1025> # Consumer Proxy that pulls in data via Syncrepl and pushes out via slapd-ldap 1026> ############################################################################## 1027> 1028> database ldap 1029> # ignore conflicts with other databases, as we need to push out to same suffix 1030> hidden on 1031> suffix "dc=suretecsystems,dc=com" 1032> rootdn "cn=slapd-ldap" 1033> uri ldap://localhost:9012/ 1034> 1035> lastmod on 1036> 1037> # We don't need any access to this DSA 1038> restrict all 1039> 1040> acl-bind bindmethod=simple 1041> binddn="cn=replicator,dc=suretecsystems,dc=com" 1042> credentials=testing 1043> 1044> syncrepl rid=001 1045> provider=ldap://localhost:9011/ 1046> binddn="cn=replicator,dc=suretecsystems,dc=com" 1047> bindmethod=simple 1048> credentials=testing 1049> searchbase="dc=suretecsystems,dc=com" 1050> type=refreshAndPersist 1051> retry="5 5 300 5" 1052> 1053> overlay syncprov 1054 1055A replica configuration for this type of setup could be: 1056 1057> ####################################################################### 1058> # Standard OpenLDAP Slave without Syncrepl 1059> ####################################################################### 1060> 1061> include /usr/local/etc/openldap/schema/core.schema 1062> include /usr/local/etc/openldap/schema/cosine.schema 1063> include /usr/local/etc/openldap/schema/nis.schema 1064> include /usr/local/etc/openldap/schema/inetorgperson.schema 1065> 1066> include /usr/local/etc/openldap/slapd.acl 1067> 1068> modulepath /usr/local/libexec/openldap 1069> moduleload back_mdb.la 1070> moduleload syncprov.la 1071> moduleload back_monitor.la 1072> moduleload back_ldap.la 1073> 1074> pidfile /usr/local/var/slapd.pid 1075> argsfile /usr/local/var/slapd.args 1076> 1077> loglevel sync stats 1078> 1079> database mdb 1080> suffix "dc=suretecsystems,dc=com" 1081> directory /usr/local/var/openldap-slave/data 1082> 1083> maxsize 85899345920 1084> checkpoint 1024 5 1085> 1086> index objectClass eq 1087> # rest of indexes 1088> index default sub 1089> 1090> rootdn "cn=admin,dc=suretecsystems,dc=com" 1091> rootpw testing 1092> 1093> # Let the replica DN have limitless searches 1094> limits dn.exact="cn=replicator,dc=suretecsystems,dc=com" time.soft=unlimited time.hard=unlimited size.soft=unlimited size.hard=unlimited 1095> 1096> updatedn "cn=replicator,dc=suretecsystems,dc=com" 1097> 1098> # Refer updates to the master 1099> updateref ldap://localhost:9011 1100> 1101> database monitor 1102> 1103> database config 1104> rootpw testing 1105 1106You can see we use the {{updatedn}} directive here and example ACLs ({{F:usr/local/etc/openldap/slapd.acl}}) for this could be: 1107 1108> # Give the replica DN unlimited read access. This ACL may need to be 1109> # merged with other ACL statements. 1110> 1111> access to * 1112> by dn.base="cn=replicator,dc=suretecsystems,dc=com" write 1113> by * break 1114> 1115> access to dn.base="" 1116> by * read 1117> 1118> access to dn.base="cn=Subschema" 1119> by * read 1120> 1121> access to dn.subtree="cn=Monitor" 1122> by dn.exact="uid=admin,dc=suretecsystems,dc=com" write 1123> by users read 1124> by * none 1125> 1126> access to * 1127> by self write 1128> by * read 1129 1130In order to support more replicas, just add more {{database ldap}} sections and 1131increment the {{syncrepl rid}} number accordingly. 1132 1133Note: You must populate the Master and Slave directories with the same data, 1134unlike when using normal Syncrepl 1135 1136If you do not have access to modify the master directory configuration you can 1137configure a standalone ldap proxy, which might look like: 1138 1139!import "push-based-standalone.png"; align="center"; title="Syncrepl Standalone Proxy Mode" 1140FT[align="Center"] Figure X.Y: Replacing slurpd with a standalone version 1141 1142The following configuration is an example of a standalone LDAP Proxy: 1143 1144> include /usr/local/etc/openldap/schema/core.schema 1145> include /usr/local/etc/openldap/schema/cosine.schema 1146> include /usr/local/etc/openldap/schema/nis.schema 1147> include /usr/local/etc/openldap/schema/inetorgperson.schema 1148> 1149> include /usr/local/etc/openldap/slapd.acl 1150> 1151> modulepath /usr/local/libexec/openldap 1152> moduleload syncprov.la 1153> moduleload back_ldap.la 1154> 1155> ############################################################################## 1156> # Consumer Proxy that pulls in data via Syncrepl and pushes out via slapd-ldap 1157> ############################################################################## 1158> 1159> database ldap 1160> # ignore conflicts with other databases, as we need to push out to same suffix 1161> hidden on 1162> suffix "dc=suretecsystems,dc=com" 1163> rootdn "cn=slapd-ldap" 1164> uri ldap://localhost:9012/ 1165> 1166> lastmod on 1167> 1168> # We don't need any access to this DSA 1169> restrict all 1170> 1171> acl-bind bindmethod=simple 1172> binddn="cn=replicator,dc=suretecsystems,dc=com" 1173> credentials=testing 1174> 1175> syncrepl rid=001 1176> provider=ldap://localhost:9011/ 1177> binddn="cn=replicator,dc=suretecsystems,dc=com" 1178> bindmethod=simple 1179> credentials=testing 1180> searchbase="dc=suretecsystems,dc=com" 1181> type=refreshAndPersist 1182> retry="5 5 300 5" 1183> 1184> overlay syncprov 1185 1186As you can see, you can let your imagination go wild using Syncrepl and 1187{{slapd-ldap(8)}} tailoring your replication to fit your specific network 1188topology. 1189