vrr/nDPI

mirror of https://github.com/vel21ripn/nDPI.git synced 2026-05-04 09:50:16 +00:00

Author	SHA1	Message	Date
Ivan Nardi	a7c2734b38	Remove classification "by-ip" from protocol stack (#1743 ) Basically: * "classification by-ip" (i.e. `flow->guessed_protocol_id_by_ip` is NEVER returned in the protocol stack (i.e. `flow->detected_protocol_stack[]`); * if the application is interested into such information, it can access `ndpi_protocol->protocol_by_ip` itself. There are mainly 4 points in the code that set the "classification by-ip" in the protocol stack: the generic `ndpi_set_detected_protocol()`/ `ndpi_detection_giveup()` functions and the HTTP/STUN dissectors. In the unit tests output, a print about `ndpi_protocol->protocol_by_ip` has been added for each flow: the huge diff of this commit is mainly due to that. Strictly speaking, this change is NOT an API/ABI breakage, but there are important differences in the classification results. For examples: * TLS flows without the initial handshake (or without a matching SNI/certificate) are simply classified as `TLS`; * similar for HTTP or QUIC flows; * DNS flows without a matching request domain are simply classified as `DNS`; we don't have `DNS/Google` anymore just because the server is 8.8.8.8 (that was an outrageous behaviour...); * flows previusoly classified only "by-ip" are now classified as `NDPI_PROTOCOL_UNKNOWN`. See #1425 for other examples of why adding the "classification by-ip" in the protocol stack is a bad idea. Please, note that IPV6 is not supported :( (long standing issue in nDPI) i.e. `ndpi_protocol->protocol_by_ip` wil be always `NDPI_PROTOCOL_UNKNOWN` for IPv6 flows. Define `NDPI_CONFIDENCE_MATCH_BY_IP` has been removed. Close #1687	2022-09-20 22:24:47 +02:00
Ivan Nardi	4f584f78a0	Fix `ndpi_do_guess()` (#1731 ) Avoid a double call of `ndpi_guess_host_protocol_id()`. Some code paths work for ipv4/6 both Remove some never used code.	2022-09-12 19:28:41 +02:00
Ivan Nardi	0a47f745cc	Avoid useless host automa lookup (#1724 ) The host automa is used for two tasks: * protocol sub-classification (obviously); * DGA evaluation: the idea is that if a domain is present in this automa, it can't be a DGA, regardless of its format/name. In most dissectors both checks are executed, i.e. the code is something like: ``` ndpi_match_host_subprotocol(..., flow->host_server_name, ...); ndpi_check_dga_name(..., flow->host_server_name,...); ``` In that common case, we can perform only one automa lookup: if we check the sub-classification before the DGA, we can avoid the second lookup in the DGA function itself.	2022-09-05 13:59:51 +02:00
Ivan Nardi	405a52ed65	Patricia tree, Ahocarasick automa, LRU cache: add statistics (#1683 ) Add (basic) internal stats to the main data structures used by the library; they might be usefull to check how effective these structures are. Add an option to `ndpiReader` to dump them; enabled by default in the unit tests. This new option enables/disables dumping of "num dissectors calls" values, too (see `b4cb14ec`).	2022-07-29 15:25:00 +02:00
Ivan Nardi	b4cb14ec19	Keep track of how many dissectors calls we made for each flow (#1657 )	2022-07-11 09:47:47 +02:00
Luca Deri	ab09b8ce2e	Added unidirectional traffic flow risk	2022-06-20 00:22:13 +02:00
Ivan Nardi	3a087e951d	Add a "confidence" field about the reliability of the classification. (#1395 ) As a general rule, the higher the confidence value, the higher the "reliability/precision" of the classification. In other words, this new field provides an hint about "how" the flow classification has been obtained. For example, the application may want to ignore classification "by-port" (they are not real DPI classifications, after all) or give a second glance at flows classified via LRU caches (because of false positives). Setting only one value for the confidence field is a bit tricky: more work is probably needed in the next future to tweak/fix/improve the logic.	2022-01-11 15:23:39 +01:00
Ivan Nardi	b1e9245d94	ndpiReader: slight simplificaton of the output (#1378 )	2021-11-27 17:32:23 +01:00
Nardi Ivan	03d3e1bafc	Fix parsing of ipv6 packets with extension headers Decoding of ipv6 traffic with extension headers was completely broken, since the beginning of the L4 header was always set to a wrong value. Handle the ipv6 fragments in the same way as the ipv4 ones: keep the first one and drop the others.	2021-09-19 17:29:22 +02:00
Luca Deri	e8455236bd	Updated output	2021-08-07 17:38:33 +02:00
Ivan Nardi	cccf794265	ndpiReader: add statistics about nDPI performance (#1240 ) The goal is to have a (roughly) idea about how many packets nDPI needs to properly classify a flow. Log this information (and guessed flows number too) during unit tests, to keep track of improvements/regressions across commits.	2021-07-13 12:28:39 +02:00
Luca Deri	23a15bae5f	Fixes #1029	2020-11-27 18:51:56 +01:00
Zied Aouini	43c1f6a3fd	CAPWAP tunnel decoding fix (#1038 ) * Fix CAPWAP processing. * Update result.	2020-10-21 15:07:20 +02:00
Luca Deri	dd75060932	Fixed false positive in suspicous user agent Optimized stddev calculation	2020-08-30 12:25:15 +02:00
Luca Deri	e71df49b3e	Changed due to bin size extension	2020-07-30 00:06:46 +02:00
Luca Deri	12abcd516b	Updated test results due to bin changes	2020-07-09 17:28:02 +02:00
Luca Deri	1a62f4c799	Added ndpi_bin_XXX API Added packet lenght distribution bins	2020-06-22 01:02:54 +02:00
Nardi Ivan	f965983c23	Add basic support for some ip-in-ip tunnels Add support for 4in4, 6in6 and 4in6 encapsulations Add support for ipv6 traffic in gtp tunnels, too To allow gtp unit test, gtp detunneling flag has been globally enabled in the test suite	2020-04-23 10:55:33 +02:00
emanuele-f	fd94270507	Remove decimals in test results for IAT, packet lengths and goodput ratio	2020-02-14 11:42:20 +01:00
Luca Deri	0703ab5ac5	Improved DNS response decoding The first decoded address is now reported by ndpiReader	2020-02-04 22:16:54 +01:00
Luca Deri	e98b994a39	Updated results	2019-11-21 13:35:04 +01:00
Luca Deri	fd38b752c4	Added capwap support	2019-10-27 19:03:23 +01:00

22 commits