vrr/nDPI

mirror of https://github.com/vel21ripn/nDPI.git synced 2026-04-28 23:19:42 +00:00

Author	SHA1	Message	Date
Luca Deri	901e317422	Added --cfg "tls,max_num_blocks_to_analyze,X" for dynamically setting TLS blocks number (#3073 ) * Added --cfg "tls,max_num_blocks_to_analyze,X" where if X > 0 TLS blocks are analyzed Example --cfg "tls,max_num_blocks_to_analyze,8" * TLS blocks now include a time-delta (msec) with respect to the previous TLS block. The format is @<msec delta>. Example: "tls_blocks": [ "22:1=232@191", "22:2=-122@5,20=-1@5,21=-23@5,21=-905@5,21=-281@5", "21=-53@0", "20=1@3,21=53@3", "21=-218@119,21=-218@119", ]	2026-01-08 23:36:13 +01:00
Luca Deri	412c63df19	Enhanced TLS blocks computation and included in nDPI Fingerprint (#3071 ) * Added TLS blocks serialization "tls_blocks": [ "22:1=496", "22:2=-122,20=-1,21=-27,21=-871,21=-281,21=-53", "20=1" ] Howto read it - TLS records belonging to the same (reassembled) packet go on the same line - The format is <record type>=<record len> - The record lenght is positive is sent cli->srv, negative otherwise - In order to avoid the SNI lenght (present in ClientHello) to influence the lenght, the ClientHello record lenght does not include the SNI lenght (if SNI is present) * TLS blocks are now reported in numerical form Extended TLS blocks analysis to blocks othr thank client/server hello nDPI fingerprint now includes initial TLS blocks Added padding (RFC 7685) in the list of TLS ephemeral extensions	2026-01-04 23:15:08 +01:00
Luca Deri	37ca034697	(C) update	2026-01-01 10:31:40 +01:00
Luca	7d00f37528	Removed unncessary serialization	2025-12-29 18:48:33 +01:00
Luca Deri	612c1d2264	tls_blocks in JSON are now symbolic	2025-12-27 21:04:59 +01:00
Luca Deri	8b7e588e42	Enhanced TLS Blocks Computation (#3068 )	2025-12-27 20:43:59 +01:00
Luca Deri	e49fa91627	Added tls_blocks serialization in JSON/csv Use --cfg "tls,blocks_analysis,1" with ndpiReader	2025-12-26 21:06:19 +01:00
Luca Deri	5a0df66a45	Exported bins in JSON/csv	2025-12-26 19:53:03 +01:00
Luca Deri	159c05f032	Added ability to export SSH key exchanges (disabled by default). It's possible to enable it using "--cfg=ssh,metadata.ssh_data,1" in ndpiReader. When enabled the negotiated SSH key exchange method is returned.	2025-12-20 20:19:17 +01:00
Luca Deri	3f2f1f8ce4	Added ability to define protocol dissectors in shared libraries (#3047 ) * Added ability to define protocol dissectors in shred libraries and load them at runtime --------- Co-authored-by: Ivan Nardi <nardi.ivan@gmail.com>	2025-12-04 15:26:15 +01:00
Ivan Nardi	b762509177	S7Comm: follow-up to complete monitoring feature (#3045 )	2025-11-28 18:11:24 +01:00
Ivan Nardi	19ee4f6c33	Build system: minor fixes about flag compilation and example dependencies (#3038 ) - always use `-Wextra` compilation flag; it was already used in CI - always compile `ndpiSimpleIntegration` when building examples - don't mess with optimization flags: `CFLAGS` default value is "-g -O2" and the user can change it Try to test -O1,2,3,s flags in CI. Fix some warnings.	2025-11-21 15:51:29 +01:00
Luca Deri	5c327aafa0	Added nDPI Configuration Export (#3022 ) * In order to reduce ndpi_main.c file size: - Removed nDPI configuration code from ndpi_main.c and placed into ndpi_config.c - Moved some utils functions from ndpi_main.c to ndpi_utils.c * Added - ndpi_dump_host_based_protocol_id() - ndpi_dump_host_based_category_id() to enable users to dump protocolId and categoryId of host-based protocols ndpiReader - Added --protos-dump <mode> \| Dump host-based protocolId (mode=1) and categoryId (mode=2)	2025-11-09 19:39:47 +01:00
Ivan Nardi	e22a434709	Rework API to set custom memory allocator functions (#3023 ) Full accounting of memory used by the library. Change `ndpi_realloc()` prototype to be compatible with standard `realloc()`. Be compatible with croaring allocation logic. Note that aligned allocations are used only by croaring code. Note that flow allocations are used only by the application, not by the library. API changes: * remove `set_ndpi_malloc()` and `set_ndpi_free()`; use `ndpi_set_memory_alloction_functions()` instead	2025-11-09 13:11:55 +01:00
Ivan Nardi	433f708951	Fix compilation when using external libgcrypt (#3018 ) ndpiReader: fix encodeDomainsUnitTest test	2025-11-04 10:41:00 +01:00
Ivan Nardi	a9e38cc504	ndpiReader: fix typo Credits to @s4n-cz. Close #3015	2025-11-03 12:36:12 +01:00
Ivan Nardi	83d85775a8	Provide an explicit state for the flow classification process (#2942 ) Application should keep calling nDPI until flow state became `NDPI_STATE_CLASSIFIED`. The main loop in the application is simplified to something like: ``` res = ndpi_detection_process_packet(...); if(res->state == NDPI_STATE_CLASSIFIED) { /* Done: you can get finale classification and all metadata. nDPI doesn't need more packets for this flow / } else { / nDPI needs more packets for this flow. The provided classification is not final and more metadata might be extracted. If `res->state` is `NDPI_STATE_PARTIAL`, partial/initial classification is available in `res->proto` as usual but it can be updated later. / } / Example A (QUIC flow): pkt 1: proto QUIC state NDPI_STATE_PARTIAL pkt 2: proto QUIC/Youtube state NDPI_STATE_CLASSIFIED Example B (GoogleMeet call): pkt 1: proto STUN state NDPI_STATE_PARTIAL pkt N: proto DTLS state NDPI_STATE_PARTIAL pkt N+M: proto DTLS/GoogleCall state NDPI_STATE_CLASSIFIED Example C (standard TLS flow): pkt 1: proto Unknown state NDPI_STATE_INSPECTING pkt 2: proto Unknown state NDPI_STATE_INSPECTING pkt 3: proto Unknown state NDPI_STATE_INSPECTING pkt 4: proto TLS/Facebook state NDPI_STATE_PARTIAL pkt N: proto TLS/Facebook state NDPI_STATE_CLASSIFIED / } ``` You can take a look at `ndpiReader` for a slightly more complex example. API changes: remove the third parameter from `ndpi_detection_giveup()`. If you need to know if the classification flow has been guessed, you can access `flow->protocol_was_guessed` * remove `ndpi_extra_dissection_possible()` * change some prototypes from accepting `ndpi_protocol foo` to `ndpi_master_app_protocol bar`. The update is trivial: from `foo` to `foo.proto`	2025-11-03 12:08:15 +01:00
Luca Deri	e9751cec26	Added TLS Block Analysis (#3016 ) * Enabled TLS block analysis via --cfg=tls,blocks_analysis,1 * Added comment and optimization * Updated output format * Code cleanup	2025-10-27 10:21:26 +01:00
Ivan Nardi	20892cf4fc	Extend values saved in hash data structure to `u_int64_t` (#3013 ) Move from `u_int32_t` to `u_int64_t`. We want to be able to save protocol + category + breed in the same entry.	2025-10-24 17:58:08 +02:00
Ivan Nardi	01836e0071	Proper handling of internal/external ids in FPC; fix FPC with custom rules (#3007 )	2025-10-22 21:28:12 +02:00
Ivan Nardi	faca0a6565	ndpiReader: improve statistics	2025-10-22 20:34:29 +02:00
Luca Deri	79b74115d2	Fixes invalid initialization that caused the two commands below to return different results ./example/ndpiReader -t -i ./tests/pcap/bets.pcapng -L ./lists/public_suffix_list.dat -G ./lists/ ./example/ndpiReader -t -i ./tests/pcap/bets.pcapng -G ./lists/	2025-10-21 15:10:28 +02:00
Luca Deri	735e0df40c	Updated test	2025-10-18 00:22:14 +02:00
Ivan Nardi	9d22805954	Add statistics about hash data structures (#2995 )	2025-10-17 20:39:15 +02:00
Ivan Nardi	a9cc75d634	ndpiReader: fix memory accounting (#2988 ) We don't know how much memory we are currently using: we only know the amount of total memory allocated. Use proper label to report this information in a correct way	2025-10-12 18:12:01 +02:00
Ivan Nardi	a07d55005d	fuzz: try to improve fuzzing coverage (#2981 )	2025-10-06 20:44:31 +02:00
Ivan Nardi	3a06d2037f	ndpiReader: create a wrapper to configure nDPI (local) context (#2979 ) Use it to better test domains, too	2025-10-05 11:39:46 +02:00
Ivan Nardi	8ad62d7e7f	ndpiReader: quick test for a list of domains (#2978 )	2025-10-03 20:06:51 +02:00
Ivan Nardi	5aaab7f354	Fix `ndpi_is_valid_hostname()` (#2974 ) It was completly broken. Pay some attention to HTTP case where we might have Host header in the "$DOMAIN:$PORT" form: we usually want to strip the port part `memrchr` is not available on macOS and on Windows: create a wrapper	2025-09-29 12:27:21 +02:00
Luca Deri	15f8dad9e8	Modified ndpi_ranking_add_epoch() API	2025-09-27 22:16:25 +02:00
Ivan Nardi	ddd277fc44	HTTP: add further configuration to enable/disable metadata extraction (#2972 ) Rename existing configuration knobs, to better separate metadata from requests, from metadata from responses	2025-09-23 15:11:25 +02:00
Ivan Nardi	1c1535738f	ndpiReader: ranking unit tests: disable logging	2025-09-23 14:38:25 +02:00
Luca	52ce501355	Improved ndpi_ranling calculation for - keeping track of the number of updates without rank changes - not creating new slots (but overwriting the last one) when a new update with no rank changes is computed. This way in the ranking atastructure there are only entries that caused ranking chnages	2025-09-17 19:45:04 +02:00
Ivan Nardi	6a3228388b	ndpiReader: improve debug option '-x' to test category matches	2025-09-05 19:58:25 +02:00
Luca Deri	52d4607bbd	Extended ndpi_ranking_add_epoch() API	2025-09-05 07:33:08 +02:00
Ivan Nardi	efccc7d5e4	Rework flow breed (#2926 ) Right now, there is, in essence, a static mapping between flow protocols and flow breeds. Make it dynamic: allow to have different flows, with the same classification but differents breeds. This is the same logic that we already have for categories.... Preliminary work to support breed in category lists. API change from the app POV: to get the flow breed don't use anymore `ndpi_get_proto_breed()`, but access directly `struct ndpi_proto->breed` The functions `ndpi_domain_classify_*()` and `ndpi_get_host_domain_suffix()` now have a `u_int32_t` parameter as `class_id` (instead of `u_int_16_t`), with the following logic: ``` class_id = (breed << 16) \| category ``` instead of the old: ``` class_id = category ``` Please note that this change is back-compatible: if you are not interested into breeds, you don't need to update the application code.	2025-09-02 16:54:34 +02:00
Ivan Nardi	1da8b85ee7	Fix compilation and unit tests (#2948 ) ``` ndpi_analyze.c: In function ‘ndpi_deserialize_ranking’: ndpi_analyze.c:2244:3: warning: ignoring return value of ‘fread’ declared with attribute ‘warn_unused_result’ [-Wunused-result] 2244 \| fread(&rank->header, sizeof(ndpi_ranking_header), 1, fd); \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ndpi_main.c: In function ‘ndpi_match_host_subprotocol’: ndpi_main.c:11798:9: warning: ‘__builtin_strncpy’ output may be truncated copying between 0 and 63 bytes from a string of length 255 [-Wstringop-truncation] 11798 \| strncpy(str, string_to_match, ndpi_min(string_to_match_len, sizeof(str)-1)); \| ^ ndpi_main.c:11811:7: warning: ‘__builtin_strncpy’ output may be truncated copying between 0 and 63 bytes from a string of length 255 [-Wstringop-truncation] 11811 \| strncpy(str, string_to_match, ndpi_min(string_to_match_len, sizeof(str)-1)); ```	2025-08-30 21:05:40 +02:00
Luca Deri	a6e2b4e252	Initial (WiP/basic) implementation of the ranking detection API used to determine rank changes void ndpi_init_ranking(ndpi_ranking rank, u_int16_t max_num_items, u_int16_t num_epochs); void ndpi_term_ranking(ndpi_ranking rank); bool ndpi_serialize_ranking(ndpi_ranking rank, const char path); bool ndpi_deserialize_ranking(ndpi_ranking rank, const char path); void ndpi_print_ranking(ndpi_ranking rank); u_int16_t ndpi_ranking_add_epoch(ndpi_ranking rank, u_int32_t epoch, ndpi_ranking_epoch_entry entries, u_int16_t num_epoch_entries, ndpi_ranking_change changes /* Out */);	2025-08-28 16:26:44 +02:00
Luca Deri	11d74ea286	Implemented nDPI fingerprint that is computed using - TCP fingerprint - JA4 fingepriint - TLS SHA1 certificate (if present), or JA3S fingerprint (is SHA1 is missing) By default the fingerprint uses the client and server fingerprints (format 0) and combines them. However you can chnge it format (eg. use only the client info, format 1) with --cfg NULL,metadata.ndpi_fingerprint_format,X where X is the fingerprint format. By default nDPI fingerprint is enabled but you can enable/disble it as follows --cfg NULL,metadata.ndpi_fingerprint,0	2025-08-21 10:34:49 +02:00
fanxb	7a2ca82c9d	ndpiReader: Fix the crash issue during protocol guessing in multi-core scenarios. (#2939 )	2025-08-08 11:58:17 +02:00
Ivan Nardi	8dd2220116	Add the concept of protocols stack: more than 2 protocols per flow (#2913 ) The idea is to remove the limitation of only two protocols ("master" and "app") in the flow classifcation. This is quite handy expecially for STUN flows and, in general, for any flows where there is some kind of transitionf from a cleartext protocol to TLS: HTTP_PROXY -> TLS/Youtube; SMTP -> SMTPS (via STARTTLS msg). In the vast majority of the cases, the protocol stack is simply Master/Application. Examples of real stacks (from the unit tests) different from the standard "master/app": * "STUN.WhatsAppCall.SRTP": a WA call * "STUN.DTLS.GoogleCall": a Meet call * "Telegram.STUN.DTLS.TelegramVoip": a Telegram call * "SMTP.SMTPS.Google": a SMTP connection to Google server started in cleartext and updated to TLS * "HTTP.Google.ntop": a HTTP connection to a Google domain (match via "Host" header) and to a ntop server (match via "Server" header) The logic to create the stack is still a bit coarse: we have a decade of code try to push everything in only ywo protocols... Therefore, the content of the stack is still highly experimental and might change in the next future; do you have any suggestions? It is quite likely that the legacy fields "master_protocol" and "app_protocol" will be there for a long time. Add some helper to use the stack: ``` ndpi_stack_get_upper_proto(); ndpi_stack_get_lower_proto(); bool ndpi_stack_contains(struct ndpi_proto_stack s, u_int16_t proto_id); bool ndpi_stack_is_tls_like(struct ndpi_proto_stack s); bool ndpi_stack_is_http_like(struct ndpi_proto_stack *s); ``` Be sure new stack logic is compatible with legacy code: ``` assert(ndpi_stack_get_upper_proto(&flow->detected_protocol.protocol_stack) == ndpi_get_upper_proto(flow->detected_protocol)); assert(ndpi_stack_get_lower_proto(&flow->detected_protocol.protocol_stack) == ndpi_get_lower_proto(flow->detected_protocol)); ```	2025-08-01 10:05:50 +02:00
Ivan Nardi	44b9a2da81	ndpiReader: add breed to flow information (#2924 )	2025-07-30 18:46:28 +02:00
Luca Deri	8f661f9aa3	Cosmetic changes	2025-07-18 21:46:43 +02:00
Fábio Depin	4eff2cdb99	Refactor: make src_name/dst_name dynamically allocated to reduce RAM usage (#2908 ) - Changed ndpi_flow_info: replaced fixed-size char arrays (always INET6_ADDRSTRLEN) for src_name and dst_name with char* pointers. - Now IPv4 flows use only INET_ADDRSTRLEN when needed, instead of always reserving IPv6 size.	2025-07-02 07:41:55 +02:00
Fábio Depin	8987a2c184	Fix logic: reset stats once per thread after clearing all flow roots (#2905 ) Call ndpi_stats_reset() once per thread instead of once per flow root Moved ndpi_stats_reset() outside the loop that destroys ndpi_flows_root[] to avoid redundant resets. The stats structure is shared per thread and should only be reset once after all roots are cleared.	2025-06-24 15:07:20 +02:00
Fábio Depin	c2526cffc1	Fix stats memory reuse and cleanup across duration loops in ndpiReader (#2903 ) (#2904 ) Refactored stats allocation and reset logic to avoid segmentation faults when running ndpiReader in live_capture mode with the -m (duration) option. - Introduced ndpi_stats_init(), ndpi_stats_reset(), and ndpi_stats_free() to encapsulate lifecycle management of stats. - Applied these functions in ndpiReader.c and reader_util.{c,h}. - Prevented multiple allocations and ensured safe reuse of cumulative_stats and per-thread stats structures between capture iterations. Fixes: https://github.com/ntop/nDPI/issues/2903	2025-06-24 09:48:34 +02:00
Ivan Nardi	978ca1ba1a	New API to enable/disable protocols. Removed `NDPI_LAST_IMPLEMENTED_PROTOCOL` (#2894 ) Change the API to enable/disable protocols: you can set that via the standard `ndpi_set_config()` function, as every configuration parameters. By default, all protocols are enabled. Split the (local) context initialization into two phases: * `ndpi_init_detection_module()`: generic part. It does not depend on the configuration and on the protocols being enabled or not. It also calculates the real number of internal protocols * `ndpi_finalize_initialization()`: apply the configuration. All the initialization stuff that depend on protocols being enabled or not must be put here This is the last step to have the protocols number fully calculated at runtime Remove a (now) useless fuzzer. Important API changes: * remove `NDPI_LAST_IMPLEMENTED_PROTOCOL` define * remove `ndpi_get_num_internal_protocols()`. To get the number of configured protocols (internal and custom) you must use `ndpi_get_num_protocols()` after having called `ndpi_finalize_initialization()`	2025-06-23 11:24:18 +02:00
Ivan Nardi	6cbc8d1471	fuzz: fuzz loading of external protocols lists (#2897 )	2025-06-22 20:43:16 +02:00
Ivan Nardi	aa6dcad15e	ndpiReader: print categories summary (#2895 )	2025-06-21 12:41:00 +02:00
Luca Deri	3a243bb40d	Merged protocols (now free to use) into existing categories - AdultContent -> Category Adult Content - LLM -> Category Artificial Intelligence	2025-06-17 23:57:15 +02:00

1 2 3 4 5 ...

748 commits