| Scroll Ignore |
|---|
| scroll-pdf | true |
|---|
| scroll-office | true |
|---|
| scroll-chm | true |
|---|
| scroll-docbook | true |
|---|
| scroll-eclipsehelp | true |
|---|
| scroll-epub | true |
|---|
| scroll-html | true |
|---|
|
|
Les logs de la rétention Mongodb du Scheduler sont classés par catégorie afin de pouvoir différencier les types de log :
- Sauvegarde
- Chargement
- La suppression des lignes retentions obsolètes.
Sur réception du signal SIGUSR1 le module va effectuer un dump de sa mémoire, pour tout autre signal, le module va s'éteindre
| Code Block |
|---|
[2021-04-21 10:24:49] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTION ] The worker with the pid XXXX received a signal XX |
Quand le processus de pilotage s'arrête de façon inopinée
| Code Block |
|---|
[2021-04-21 10:24:49] ERROR : [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTION ] I am a worker with pid: XXXX and my master process YYYY is dead, I exit. |
Demande d'un dump de la mémoire
| Code Block |
|---|
[2021-04-21 10:24:49] ERROR : [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTION ] (support-only) MEMORY DUMP (to be sent to the support):
xxxxxxxx
xxxxxxxx
xxxxxxxx |
| Code Block |
|---|
[2021-04-21 10:24:49] ERROR : [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTION ] (support-only) Memory information dumped to file FFFFFFF (to be sent to the support) |
| Code Block |
|---|
[2021-04-21 10:24:49] ERROR : [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTION ] MEMORY DUMP: FAIL check if guppy lib is installed |
| Code Block |
|---|
[2021-04-21 10:24:49] ERROR : [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTION ] (support-only) MEMORY DUMP: FAIL check if meliae lib is installed |
Connexion à la base de données
| Code Block |
|---|
[2021-04-21 10:24:49] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTION ] We are creating mongo connection [uri=mongodb://192.168.1.120/?safe=false] [database=shinken] [ssh=True]
[2021-04-21 10:24:49] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTION ] Connection created in : 0.200s
|
| Code Block |
|---|
[2021-04-21 10:24:49] WARNING: [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTION ] Mongo connection failed 1/X time, we will try again
[2021-04-21 10:24:49] WARNING: [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTION ] Mongo connection failed Y/X times, we will try again
[2021-04-21 10:24:49] ERROR : [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTION ] Mongo connection failed X/X times, we stop trying |
La connexion a été perdue ou n'existe pas
| Code Block |
|---|
[2021-04-21 10:24:49] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTION ] We need to create a mongo connection |
suivi des logs de la connexion normale
La connexion n'a pas pu être établie
| Code Block |
|---|
[2021-04-21 10:24:49] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTION ] Could not create mongo connection |
Erreur de configuration du module
Erreurs pendant la suppression d'anciennes données ou lors de la sauvegarde
Si une erreur survient pendant une opération en base de données, les logs suivants vont apparaître :
| Code Block |
|---|
|
[2019-07-10 16:19:10] WARNING: [SCHEDULERNAME] [ MongodbRetention |
Si plusieurs url mongo sont précisé
| Code Block |
|---|
[2021-04-20 13:52:26] ERROR : [ SCHEDULERNAME ] [ MODULESSOUS-MANAGER SECTION ] TheWe instancehave MongodbRetentionbeen raiseddisconnected anof error:mongo. MultiplesWill urls were found in the module's configuration file. I disable it and set it to restart it later |
Pour la sauvegarde de la rétention, trois types de logs existent:
retry [1/3]
[2019-07-10 16:19:10] WARNING: [SCHEDULERNAME] [ MongodbRetention ] [ SOUS-SECTION ] We have been disconnected of mongo. Will retry [2/3]
[2019-07-10 16:19:10] WARNING: [SCHEDULERNAME] [ MongodbRetention ] [ SOUS-SECTION ] We have been disconnected of mongo. Will retry [3/3]
[2019-07-10 16:19:10] ERROR : [SCHEDULERNAME] [ MongodbRetention ] [ SOUS-SECTION ] After 3 tries, we couldn't connect to mongo
|
| Section | Description |
|---|
| SAVE GLOBAL | Correspond au processus global de la sauvegarde |
| SAVE WORKERS | Corresponds à un sous-processus de SAVE GLOBAL, qui s'occupe de la file d'attente des différents workers de la sauvegarde |
| SAVE WORKER X | C'est un sous-processus de SAVE WORKERS, correspondant à un worker numéroté X qui permet de sauvegarder une partie des informations du Scheduler en base. Le nombre de workers est paramétrable dans les paramètres du module. ( voir Rétention en base de donnée centralisée par royaume ( Module MongodbRetention ) ) |
Les logs SAVE GLOBAL donnent des informations relatives au fonctionnement global du module ou de sa configuration.
| Code Block |
|---|
|
[2019-07-10 1416:3419:3910] INFOERROR : [ SCHEDULERNAME ] [ MongodbRetention ] [ SAVESOUS-SECTION GLOBAL] We have an error:[ERROR MESSAGE]
[2019-07-10 16:19:10] StartingERROR to save: retention data. [XXX:hosts[SCHEDULERNAME] [XXX:checks] (Database used = mongodb://HOST/?safe=false, use ssh = False MongodbRetention ] [ SOUS-SECTION ] (stack du Traceback)
[2019-07-10 1416:3419:3910] INFO ERROR : [ SCHEDULERNAME ] [ MongodbRetention ] [ SAVE GLOBAL ] SUCCESS Retention data was saved into mongodb. Total time X.XXs |
Erreur de configuration du module
Si plusieurs url mongo sont préciséLes erreurs lors de la sauvegarde de la rétention sont aussi enregistrées dans les logs sous cette forme:
| Code Block |
|---|
[2021-04-20 1113:52:26:57] ERROR : [ SCHEDULERNAME ] [ MODULES-MANAGER ] The instance MongodbRetention raised an error: ERRORMultiples MESSAGE.urls Totalwere time XX.XXsfound in the module's configuration file. I disable it and set it to restart it later
|
| Code Block |
|---|
|
[2021-04-20 11:26:57] ERROR : [ SCHEDULERNAME ] [ MODULES-MANAGER ] The instance MongodbRetention raised an error: [ SAVE GLOBAL ] FAILED Retention data could not be saved in mongodb. Total time 22.20s. I disable it and set it to restart it later
[2021-04-20 11:26:57] ERROR : [ SCHEDULERNAME ] [ MODULES-MANAGER ] The instance MongodbRetention raised an error: [ SAVE GLOBAL ] FAILED Retention data could not be saved in mongodb because mongo is unreachable. Total time 2.11s. I disable it and set it to restart it later |
Pour la sauvegarde de la rétention, trois types de logs existent:
| Section | Description |
|---|
| SAVE GLOBAL | Correspond au processus global de la sauvegarde |
| SAVE WORKERS | Corresponds à un sous-processus de SAVE GLOBAL, qui s'occupe de la file d'attente des différents workers de la sauvegarde |
| SAVE WORKER X | C'est un sous-processus de SAVE WORKERS, correspondant à un worker numéroté X qui permet de sauvegarder une partie des informations du Scheduler en base. Le nombre de workers est paramétrable dans les paramètres du module. ( voir Rétention en base de donnée centralisée par royaume ( Module MongodbRetention ) ) |
Les logs SAVE GLOBAL donnent des informations relatives au fonctionnement global du module ou de sa configuration.
| Code Block |
|---|
|
[2019-07-10 14:34:39] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ SAVE GLOBAL ] Starting to save retention data. [XXX:hosts] [XXX:checks] (Database used = mongodb://HOST/?safe=false, use ssh = False)
|
Les logs SAVE WORKERS donnent l'état de chaque worker de sa création à son succès/échec.
| Code Block |
|---|
|
[2019-07-10 14:34:4439] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ SAVE WORKERSGLOBAL ] ] Starting worker X with pid XXXXX. Try: X/X
[2019-07-10 14:34:54] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ SAVE WORKERS ] The worker X did SUCCESS (after X try) |
| Code Block |
|---|
SUCCESS Retention data was saved into mongodb. Total time X.XXs |
Les erreurs lors de la sauvegarde de la rétention sont aussi enregistrées dans les logs sous cette forme:
| Code Block |
|---|
[2021-04-20 11:26:57] ERROR : [ SCHEDULERNAME ] [ MODULES-MANAGER ] The instance MongodbRetention raised an error: ERROR MESSAGE. Total time XX.XXs. I disable it and set it to restart it later
|
| Code Block |
|---|
|
[2021-04-20 11:26:57] ERROR [2019-07-10 14:34:44] WARNING: [ SCHEDULERNAME ] [ MODULES-MANAGER ] The instance MongodbRetention ] raised an error: [ SAVE PERFGLOBAL ] [ X.XXXs ] atomization durationFAILED Retention data could not be saved in mongodb. Total time 22.20s. I disable it and set it to restart it later
[2021-04-20 11:26:57] ERROR : [ SCHEDULERNAME ] [ MODULES-MANAGER ] The instance MongodbRetention raised an error: [ SAVE GLOBAL ] FAILED Retention data could not be saved in mongodb because mongo is unreachable. Total time 2.11s. I disable it and set it to restart it later |
Les logs SAVE WORKERS donnent l'état de chaque worker de sa création à son succès/échec.
| Code Block |
|---|
|
| Code Block |
|---|
[2019-07-10 14:34:44] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ SAVE WORKERS ] some workers did fail to exit or encountered an error. The retention save can be incomplete.
[2019-07-10 14:34:44] ERROR : Too many tries failed
[2019-07-10 14:34:44] ERROR : Cannot start the XXXXX worker process as there is not enough memory
[2019-07-10 14:34:44] INFO ERROR : [ SCHEDULERNAME ] [ MongodbRetention ] [ SAVE WORKERS ] Cannot start the worker X process: XX. Exiting the retention save, killing all currently] launched workers |
Les logs SAVE WORKER X donne pour le worker ayant l'identifiant X, les statistiques sur les sauvegardes qu'il a effectuées : le nombre d'éléments, résultat et temps d'exécution.
| Code Block |
|---|
|
Starting worker X with pid XXXXX. Try: X/X
[2019-07-10 14:34:4454] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ SAVE WORKERWORKERS 0 ] Updating retention with elements: checks] [The XXXworker ]X --did hostsSUCCESS [(after XX ] in mongodb
X try) |
La préparation des données à sauvegarder a été longue :
| Code Block |
|---|
[2019-07-10 14:34:44] INFO WARNING: [ SCHEDULERNAME ] [ MongodbRetention ] [ SAVEPERF WORKER] 0 [ X.XXXs ] Retention data saved into mongodb in X.XXX seconds |
Erreurs
Des erreurs empêchent le bon déroulé de la sauvegarde :
| Code Block |
|---|
[2019-07-10 14:34:44] INFO WARNING: [ SCHEDULERNAME ] [ MongodbRetention ] [ SAVE WORKERS WORKER X ] The worker (pid:XXXX | try:XX) did not ] some workers did fail to exit onor timeencountered (XXan s)error. We are restarting it. |
| Code Block |
|---|
[2021-04-20 12:06:15] ERROR : [ SCHEDULERNAME ] [ MongodbRetention ] [ SAVE WORKER X ] Failed connection with the following message : ERROR MESSAGE |
Chargement de la rétention
Les logs fournissent des informations liées au chargement de la rétention, permettant de suivre son avancée et l'état sur la connexion à Mongo.
| Code Block |
|---|
The retention save can be incomplete.
[2019-07-10 14:34:44] ERROR : Too many tries failed
[2019-07-10 14:34:44] ERROR : Cannot start the XXXXX worker process as there is not enough memory
[2019-07-10 1614:1934:1044] INFO ERROR : [ SCHEDULERNAME ] [ MongodbRetention ] [ LOAD RETENTION ] [ HOSTS / CLUSTERS ] [ X.XXXs ] We took X hosts/clusters fromSAVE WORKERS ] Cannot start the worker X process: XX. Exiting the retention [save, inkilling schedulerall hosts/clusterscurrently launched workers |
Les logs SAVE WORKER X donne pour le worker ayant l'identifiant X, les statistiques sur les sauvegardes qu'il a effectuées : le nombre d'éléments, résultat et temps d'exécution.
| Code Block |
|---|
|
: without retention=X / total=1 ]
[2019-07-10 1614:1934:1044] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ SAVE LOADWORKER RETENTION0 ] [ HOSTS / CLUSTERS ] No host/cluster are needed] forUpdating retention with loadelements: (schedulerchecks already[ haveXXX all] X-- hosts [ XX hosts] retentionin data).mongodb
[2019-07-10 1614:1934:1044] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ LOADSAVE RETENTIONWORKER ]0 [ CHECKS ] Retention ]data [saved X.XXXsinto ]mongodb Wein took X.XXX seconds |
| Code Block |
|---|
[2019-07-10 14:34:44] WARNING: checks[ SCHEDULERNAME ] [ MongodbRetention ] [ SAVE WORKER fromX the retention [ in scheduler checks ] The worker (pid:XXXX | try:XX) withoutdid retention=XXnot exit on /time total=(XX s). We are restarting it. |
| Code Block |
|---|
[2021-04-20 12:06:15] ERROR ]
[2019-07-10 16:19:10] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ LOADSAVE RETENTION ] [ CHECKS WORKER X ] NoFailed checksconnection with the following message : ERROR MESSAGE |
Chargement de la rétention
Les logs fournissent des informations liées au chargement de la rétention, permettant de suivre son avancée et l'état sur la connexion à Mongo.
| Code Block |
|---|
are needed for retention load (scheduler already have all X checks retention data).
[2019-07-10 16:19:10] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ LOAD RETENTION ] [ HOSTS / CLUSTERS ] [ X.XXXs ] We took TotalX number of elementshosts/clusters load from mongo database: X ( scheduler have a total of XX elements ) the retention [ in scheduler hosts/clusters : without retention=X / total=1 ]
[2019-07-10 1416:3519:3710] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ LOAD RETENTION ] [ X.XXXs ] SUCCESS Retention data loaded successfully. |
Les erreurs lors du chargement de la rétention sont aussi enregistrées dans les logs sous cette forme:
| Code Block |
|---|
HOSTS / CLUSTERS ] No host/cluster are needed for retention load (scheduler already have all X hosts retention data).
[2019-07-10 16:19:10] INFO ERROR : [ SCHEDULERNAME ] [ MongodbRetention ] [ LOAD RETENTION ] FAILED[ RetentionCHECKS data could not be loaded from mongodb: ERROR MESSAGE DETAILS
[2019-07-10 16:19:10] ERROR : [ SCHEDULERNAME ] [ MongodbRetentionX.XXXs ] We [took LOADX RETENTION ]checks error querying host entries: ERROR MESSAGE. Module exiting.
[2019-07-10 16:19:10] ERROR : [ SCHEDULERNAME ] [ MongodbRetention ] [ LOAD RETENTION ] error querying checks entries: ERROR MESSAGE. Module exiting. |
Suppression des anciennes rétentions
Les logs de suppression permettent de voir le nombre d'objets supprimés (triés par hôtes et checks) ainsi que la date à partir de laquelle la rétention est conservée.
| Code Block |
|---|
| title | Exemple avec des objets à supprimer |
|---|
|
from the retention [ in scheduler checks : without retention=XX / total=XX ]
[2019-07-10 1516:5419:5310] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ DELETE OLD RETENTION LOAD RETENTION ] [ CHECKS ] We will delete all retention data that were saved before the XXXX-XX-XX XX:XX UTC (X days)No checks are needed for retention load (scheduler already have all X checks retention data).
[2019-07-10 1516:5419:5310] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ DELETE OLDLOAD RETENTION ] [ X.XXXs] -Total number Deletingof XXXelements hostsload from old retention [XXXX by XXXX] mongo database: X ( scheduler have a total of XX elements )
[2019-07-10 1514:5435:5337] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ DELETELOAD OLD RETENTION ] [ -X.XXXs XXX] SUCCESS -Retention hostsdata deletedloaded in X.XXXs
successfully. |
Les erreurs lors du chargement de la rétention sont aussi enregistrées dans les logs sous cette forme:
| Code Block |
|---|
[2019-07-10 1516:5419:5310] INFO ERROR : [ SCHEDULERNAME ] [ MongodbRetention ] [ LOAD DELETERETENTION OLD RETENTION ] FAILED ]Retention data -could Deletingnot XXXbe servicesloaded from oldmongodb: retentionERROR [XXXXMESSAGE DETAILS
|
| Code Block |
|---|
by XXXX]
[2019-07-10 1516:5419:5310] INFOERROR : [ SCHEDULERNAME ] [ MongodbRetention ] [ DELETELOAD OLD RETENTION ] error -querying XXXhost entries: -ERROR servicesMESSAGE. deletedModule exiting. |
| Code Block |
|---|
in X.XXXs
[2019-07-10 1516:5419:5310] INFOERROR : [ SCHEDULERNAME ] [ MongodbRetention ] [ DELETE OLDLOAD RETENTION ] Totalerror timequerying forchecks deletingentries: XXXXERROR entriesMESSAGE. = X.XXXs
Module exiting. |
Suppression des anciennes rétentions
Les logs de suppression permettent de voir le nombre d'objets supprimés (triés par hôtes et checks) ainsi que la date à partir de laquelle la rétention est conservée.
| Code Block |
|---|
| title | Exemple sans avec des objets à supprimer |
|---|
|
[2019-07-10 1415:3554:1353] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ DELETE OLD RETENTION ] We will delete all retention data that were saved before the XXXX-XX-XX XX:XX UTC (X days)
[2019-07-10 14:35:16] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ DELETE OLD RETENTION ] There is no data to delete
[2019-07-10 14:35:1615:54:53] INFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ DELETE OLD RETENTION ] Total time for deleting 0 entries = X.XXXs
|
Erreurs lors de la suppression d'anciennes données ou lors de la sauvegarde
Si une erreur est rencontrée lors de la suppression, elles seront indiquées dans les logs, comme ceux-ci:
| Code Block |
|---|
|
- Deleting XXX hosts from old retention [XXXX by XXXX]
[2019-07-10 1615:1954:1053] INFO WARNING : [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTION DELETE OLD RETENTION ] We have- beenXXX disconnected of- mongo.hosts Willdeleted retry [1/3]in X.XXXs
[2019-07-10 1615:1954:1053] INFO WARNING: [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTIONDELETE OLD RETENTION ] We have- beenDeleting disconnectedXXX ofservices mongo.from Willold retryretention [2/3XXXX by XXXX]
[2019-07-10 1615:1954:1053] WARNINGINFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTION DELETE OLD RETENTION ] We have- beenXXX disconnected of- mongo.services Willdeleted retry [3/3]in X.XXXs
[2019-07-10 1615:1954:1053] ERRORINFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTIONDELETE OLD RETENTION ] AfterTotal 3time tries,for wedeleting couldn'tXXXX connectentries to= mongoX.XXXs
|
| Code Block |
|---|
| title | Exemple sans objets à supprimer |
|---|
|
[2019-07-10 1614:1935:1013] ERRORINFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTIONDELETE OLD RETENTION ] We have an error:[ERROR MESSAGE]will delete all retention data that were saved before the XXXX-XX-XX XX:XX UTC (X days)
[2019-07-10 1614:1935:1016] INFO ERROR : [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTION ] (stack du Traceback)DELETE OLD RETENTION ] There is no data to delete
[2019-07-10 1614:1935:1016] ERRORINFO : [ SCHEDULERNAME ] [ MongodbRetention ] [ SOUS-SECTIONDELETE OLD RETENTION ] Total ...time for deleting 0 entries = X.XXXs
|