34 add propper logging #36

cwmeijer · 2024-12-18T13:38:36Z

(fixes #34)
see this pr's edits to the readme.md for an overview of the added functionality

rogerkuou

Hi @cwmeijer, thanks for the implementation! Most of thing look awesome.
I have one major concern about the performance of log_array. The others comments are all minor.

rogerkuou · 2024-12-19T10:43:07Z

src/segmentmytiff/logging_config.py

+import pandas as pd
+
+
+def setup_logger(name: str = None, level=logging.INFO):


I see the default level here is is only used for stdout_sh. When looking at the API, my intuitive feeling would be this indicates the default level of the output logger root_logger. I suggest either rename this to stdout_level, or pass this variable to the default level of root_logger. (I prefer the second)

My idea was to have logging to files for debug and info always on, and only have the level of stdout configurable. So in that sense, the second options isn't possible. I think the first option is an excellent suggestion.

rogerkuou · 2024-12-19T10:44:20Z

src/segmentmytiff/logging_config.py

+
+    # Configure logger and add handlers
+    root_logger = logging.getLogger()
+    root_logger.setLevel(logging.DEBUG)


Following the suggestion on level, proposing this.

Suggested change

root_logger.setLevel(logging.DEBUG)

root_logger.setLevel(level)

rogerkuou · 2024-12-19T11:00:47Z

src/segmentmytiff/logging_config.py

+    path = Path('log')
+    path.mkdir(exist_ok=True, parents=True)


Consider expose the path of log file as an option?

I thought about that, but I think that adds unnecessary complexity for now. If you think we need it (now or soon), especially from the plugin side, I'll go for it of course.

rogerkuou · 2024-12-19T11:14:30Z

src/segmentmytiff/logging_config.py

+    logger.info(f"{task_name} finished in {duration:.4f} seconds")
+
+
+def log_array(data: np.ndarray, logger, array_name:str="array") -> None:


In this function, despite the level of logger, all calculations be called. In case a large data, this will significantly slow down you workflow.

Maybe we should consider check the level of logger?

IMO maybe it's better to move the calculation part out of this function and pass them as a dictionary in here. I think when calling a logging function, the computation effort is usally neglected. The user should be aware what calculation is actually done.

Good point! I was focusing on gathering all data that we'd need to debug a user's problem, I didn't consider performance enough.

rogerkuou · 2024-12-19T11:16:13Z

src/segmentmytiff/utils/datasets.py

@@ -19,7 +20,7 @@ def image_path_to_mask_path(image_path: Path) -> Path:
        self.masks = [str(image_path_to_mask_path(Path(p))) for p in self.images][:self.limit]
        non_existing_masks = [p for p in self.masks if Path(p).exists() == False]
        if non_existing_masks:
-            print(f"{len(non_existing_masks)} of a total of {len(self.masks)} masks not found.")
+            logging.getLogger().warning(f"{len(non_existing_masks)} of a total of {len(self.masks)} masks not found.")


depends on how we would like this function envolve, maybe consider initiate a logger to collect all the logs? Like you did in other modules.

good catch! thanks!

cwmeijer added 4 commits December 17, 2024 16:49

add basic logging

66b2654

add logging to files

f5185ce

add logging of start parameters

f1f07bc

add logging of input and predictions

2ad719f

cwmeijer marked this pull request as ready for review December 18, 2024 15:09

cwmeijer requested a review from rogerkuou December 18, 2024 15:27

cwmeijer added 2 commits December 18, 2024 16:33

tidy up logging of training data

7ef252d

add readme entry for the logging

3e9891d

rogerkuou requested changes Dec 19, 2024

View reviewed changes

remove slow array logging and improve variable name

f431418

cwmeijer merged commit 8689614 into main Dec 19, 2024
11 checks passed

cwmeijer deleted the 34-add-propper-logging branch December 19, 2024 15:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

34 add propper logging #36

34 add propper logging #36

cwmeijer commented Dec 18, 2024 •

edited

Loading

rogerkuou left a comment

rogerkuou Dec 19, 2024

cwmeijer Dec 19, 2024

rogerkuou Dec 19, 2024

rogerkuou Dec 19, 2024

cwmeijer Dec 19, 2024

rogerkuou Dec 19, 2024

cwmeijer Dec 19, 2024

rogerkuou Dec 19, 2024

cwmeijer Dec 19, 2024

		import pandas as pd


		def setup_logger(name: str = None, level=logging.INFO):

	root_logger.setLevel(logging.DEBUG)
	root_logger.setLevel(level)

		logger.info(f"{task_name} finished in {duration:.4f} seconds")


		def log_array(data: np.ndarray, logger, array_name:str="array") -> None:

34 add propper logging #36

34 add propper logging #36

Conversation

cwmeijer commented Dec 18, 2024 • edited Loading

rogerkuou left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cwmeijer commented Dec 18, 2024 •

edited

Loading